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method can be used i.a. for optimizing the formation of biomass or a desired product by a cell by expressing different levels of uncoupled 
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A method of improving the production of biomass or a de- 
sired pr oduct from a cell 

This invention relates to a method of improving the pro- 
5 duction of biomass or a desired product from a cell by 
inducing conversion of ATP to ADP without primary effects 
on other cellular metabolites or functions. The invention 
also relates to a method of optimizing the production of 
biomass or a desired product from a cell utilizing this 
10 first method. The desired product may for example be lac- 
tic acid produced by lactic acid bacteria and ethanol or 
carbondioxide produced by yeast. 

BACKGROUND OF THE INVENTION 

15 

A wide range of microorganisms are used for the produc- 
tion of various organic compounds and heterologous pro- 
teins. One example hereof is the production of lactic 
acid and other organic compounds by the lactic acid group 
20 of bacteria, which results in the acidification and fla- 
vouring of dairy products, better known as cheese and 
yougurt production . 

From the microorganism's point of view, the organic com- 
25 pounds which are excreted from the cells are often merely 
the by-product of a process that is vital to the cells: 
the production of various forms of free energy (ATP, 
NAD ( P ) H , membrane potential, etc.). Therefore, although 
many of the microorganisms which are being employed in 
30 these processes are reasonably well suited for the pur- 
pose, there is still a great potential for optimizing the 
productivity of these organisms when looking from the 
bioreactor point of vue. Likewise, the production of het- 
erologous proteins by a microorganism is not what the or- 
35 ganism was adapted for and also here there is a potential 
for optimization. 



WO 98/10089 



2 



PCT/DK97/GG373 



Often when microorganisms are engineered for the purpose 
of optimizing an industrial production process, the reac- 
tions leading to the desired product will affect the 
delicate balance of co-factors involved in the energy me- 
5 tabolism of the cell. For instance if the glycolytic re- 
actions producing lactate from sugar were somehow to be 
enhanced (e.g. by overexpressing the glycolytic enzymes) 
this would automatically lead to the convertion of ADP to 
ATP. The ratio between the concentrations of ATP and ADP 

10 is usually quite high in the growing cell ( [ATP] /[ADP] > 
10), and when the ratio [ATP]/[ADP] changes, the sum of 
[ATP] and [ADP] still remains virtually constant. There- 
fore, if in the example above, the enhanced production of 
ATP changes the [ATP] /[ADP] ratio from 10 to say 30, this 

15 will only marginally affect the concentration of ATP. The 
ADP concentration however will change by a factor of 
three. The cells will then hardly feel the surplus of ATP 
but the ADP pool in the cells may be depleted to such an 
extent that reactions in which ADP is a co-factor or al- 

20 losteric regulator will be suppressed by the lack of ADP. 
The result may be that the total flux through the pathway 
(here through glycolysis) is only marginally increased. 
In the future, this situation is likely to occur more 
frequently, as the productivity of bioreactors are opti- 

25 mized by other means, and in these cases, it will be even 
more important ( compared to the normal cell ) to regener- 
ate the ADP from ATP, in order to further increase the 
productivity . 

30 Previously, attempts have been made to decrease the in- 
tracellular ATP concentration in yeast, employing sets of 
reactions which together form futile cycles, see EP pat- 
ent No. 245 481. Often, the first reaction of a futile 
cycle is part of the regular metabolic network of the 

35 cell, for instance the phosphorylation of a glycolytic 
intermediate, coupled to the utilisation of ATP. The sec- 
ond reaction, which may also sometimes be part of the 
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metabolic network, then de-phosphorylates the glycolytic 
intermediate without regenerating the ATP that was con- 
sumed in the first process, the overall effect being that 
a high energy phosphate bond is consumed. The limited 
5 success that this strategy has had so far, is probably 
due to the fact that it is impossible to obtain a sig- 
nificant futile flux without decreasing the concentration 
of the phosphorylated intermediate, thereby disturbing 
the cellular function and ultimately the growth. In addi- 
10 tion, when the approach is to decrease the concentration 
of a glycolytic intermediate, this will effectively re- 
move the substrate for the remaining part of the glycoly- 
sis, which will often result in a decreased flux through 
this pathway, rather than the desired increased flux. 

15 

Other strategies have been to use chemicals such as dini- 
trophosphate to stimulate the activity of the plasma mem- 
brane H + -ATPase by the addition of uncouplers of the mem- 
brane potential, or to genetically express the enzyme 

20 acid phosphatase in the cytoplasm, an enzyme that will 
remove phosphate groups from organic metabolites and pro- 
teins. However, both of these approaches suffer from the 
same inherent problem: they are unspecific and a range of 
cellular reactions/concentrations may be affected. For 

25 instance, the acid phosphatase will remove phosphate 
groups from essential metabolites and proteins, thus dis- 
turbing various metabolic fluxes and metabolic regula- 
tion. The uncoupling of the plasma membrane H + -ATPase 
will disturb the intracellular pH in addition to the gra- 

30 dient of numerous ions across the cytoplasmic membrane. 
Besides, the addition of chemicals such as dinitrophos- 
phate is undesirable for most purposes . 

SUMMARY OF THE INVENTION 

35 

The idea of the invention is to use a highly specific and 
clean way to increase the intracellular level of ADP, 
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which does not suffer from the limitations described 
above: to express in a well-controlled manner an enzyme 
that has ATP-hydrolytic activity in the living cell with- 
out producing other products and without coupling this 
5 activity to energy conservation. Such an enzymatic activ- 
ity is of course not likely to be found in a normal cell, 
because the cell would then loose some of its vital en- 
ergy reservoir. 

10 Accordingly the present invention provides a method of 
improving the production of biomass or a desired product 
from a cell, the method being characterized by expressing 
an uncoupled ATPase activity in said cell to induce con- 
version of ATP to ADP without primary effects on other 

15 cellular metabolites or functions, and incubating the 
cell with a suitable substrate to produce said biomass or 
product. 

One of the normal enzymes that comes closest to the 
20 ideal ATP-hydrolyzing enzyme, is the membrane bound H + - 
ATFase. This huge enzyme complex consists of two parts , 
the membrane integral part (Fq) and the cytoplasmic part 
(Fi). Together the two parts couples the hydrolysis of 
ATP to ADP and inorganic phosphate (Pi), to translocation 
25 of protons accross the cytoplasmic membrane, or vice 
versa, using the proton gradient to drive ATP synthesis 
from ADP and Pi . 

The method of the invention is conveniently carried out 
30 by expressing in said cell the soluble part (Fi) of the 
membrane bound (FqFi type) H + -ATPase or a portion of the 
Fi exhibiting ATPase activity. 

The membrane bound H + -ATPase complex is found in similar 
35 form in prokaryotic as well as eukaryotic organisms, and 
thus Fi and portions thereof expressing ATPase activity 
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can be expressed in both prokaryotic and eukaryotic 
cells . 

The organism from which the Fl ATPase or portions thereof 
5 is derived, or in which the Fl ATPase or portions thereof 
is expressed, may be selected from prokaryotes and eu- 
karyotes , in particular from bacteria and eukaryotic mi- 
croorganisms such as yeasts, other fungi and cell lines 
of higher organisms, in particular bakers and brewers 
10 yeast, 

A particularly interesting group of prokaryotes to which 
the method according to the invention can be implemented, 
i.a. in the dairy industry, are lactic acid bacteria of 
15 the genera Lactococcus , Streptococcus, Enterococcus , Lac- 
tobacillus and Leuconostoc , in particular strains of the 
species Lactococcus lactls and Streptococcus thermophi- 
ius. Other interesting prokaryotes are bacteria belonging 
to the genera Escherichia, Zymomonas, Bacillus and Pseu- 
domonas, in particular the species Escherichia coli f Zy- 
momonas mob i lis , Bacillus subtilis and Pseudomonas pu- 
tida. 

In an expedient manner of carrying out the method accord- 
ing to the invention the cell is transformed or trans- 
fected with an expression vector including DNA encoding 
Fi or a portion thereof exhibiting ATPase activity under 
the control of a promoter functioning in said cell, and 
said DNA is expressed in the cell. Said DNA encoding Fi 
or a portion thereof may be derived from a prokaryotic or 
a eukaryotic organism, and it may be either homologous or 
heterologous to said cell. 

The Fi part of the bacterial H + -ATPase complex consists 
of several subunits that together are responsible for 
catalyzing ATP hydrolysis: the P-subunit is thought to 
carry the actual hydrolytic site for ATP hydrolysis, but 
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in vitro ATPase activity requires that the P-subunit 
forms a complex together with the a- and y-subunit 
(aSYPs)* The activity of this complex is modulated by the 
e-subunit, so that the in vitro activity of the a 37^36 
5 complex is five fold less than the 013YP3 complex • 

In a specific embodiment of the method according to the 
invention said DNA encoding Fi or a catalytic active por- 
tion thereof, is derived from Escherichia coli, Strepto- 

10 coccus thermophi 1 us or Lactococcus lactis and is selected 
from the group consisting of the gene encoding the Fi 
subunit p or a catalytically active portion thereof and 
various combinations of said gene or portion with the 
genes encoding the Fi subunits 5, a, y and e or catalyti- 

15 cally active portions thereof. 

In particular said DNA encoding Fi or a portion thereof 
may be selected from the group consisting of the Es- 
cherichia coli r Streptococcus thermophi lus and Lactococ- 
20 cus lactis genes atpHAGDC (coding for subunits 8, a, y, 
P, e), atpAGDC (coding for subunits a, y, p, e) , atpAGD 
(coding for subunits a, y f p) , atpDC (coding for subunits 
P, e) and atpD (coding for subunit P alone) . 

25 Particularly interesting eukaryotes are the yeasts Sac- 
charomyces cerevisiae, Phaffia rhodozyma or Trichoderma 
re&sei, and the DNA encoding Fi or a portion thereof may 
be derived from such organisms and is selected from the 
group consisting of the gene encoding the Fi subunit P or 

30 a portion thereof and various combinations of said gene 
or portion with the genes encoding the other Fi subunits 
or portions thereof. 

Vectors including DNA encoding the soluble part (Fi) of 
35 the membrane bound (F0F1 type) H + -ATPase or a portion of 
Fi exhibiting ATPase activity, derived from the lactic 
acid bacteria Lactococcus lactis and Streptococcus t/ier- 
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mophilus and from the yeasts Saccharomyces cerevlslae, 
Phaffia rhodozyma or Trichoderma reesel are also com- 
prised by the invention as well as expression vectors in- 
cluding such DNA under the control of a promoter capable 
5 of directing the expression of said DNA in a prokaryotic 
or eukaryotic cell . 

Specific vectors according to the invention are plasmids 
including DNA encoding the soluble part (Fi) of the mero- 

10 brane bound (F()Fi type) H + ~ATPase or a portion of Fj ex- 
hibiting ATPase activity, said DNA being derived from 
Lactococcus lactls subsp. cremoris (SEQ ID No. 1), Lacto- 
coccus lactls subsp. lactls (SEQ ID No. 6), Streptococcus 
thermophllus (SEQ ID No. 10), Phaffia rhodozyma (SEQ ID 

15 No. 14), and Trichoderma reesei (SEQ ID No. 16). 

Further, the invention provides a method of optimizing 
the formation of biomass or a desired product by a cell, 
the method being characterized by expressing different 

20 levels of uncoupled ATPase activity in the cell, incubat- 
ing the cell on a suitable substrate, measuring the con- 
version rate of substrate into biomass or the desired 
product at each level of ATPase expression, and choosing 
a level of ATPase expression at which the conversion rate 

25 is optimized. 

Often, but not always, the optimization of a given prod- 
uct flux produced by a cell will entail the attainment of 
either maximum or minimum conversion rate of a substrate. 

30 

In an expedient manner of practicing this method of the 
invention a number of specimens of said cell are trans- 
formed or transfected with their respective expression 
vector each including DNA encoding a different portion of 
35 the cytoplasmic part (Fi) of the membrane bound (F()Fi 
type) H + -ATPase up to and including the entire Fi, each 
portion exhibiting ATPase activity, said DNA in each ex- 
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pression vector being under the control of a promoter 
functioning in said cell, incubating each cell specimen 
on a suitable substrate, measuring the conversion rate of 
substrate into biomass or the desired product in each 

5 specimen, and choosing a specimen yielding an optimal 
conversion rate. In a particular embodiment of this man- 
ner, which is especially suited for scientific studies, 
the promoter in each expression vector is an inducible 
promoter, and each cell specimen is grown at different 

10 concentrations of inducer in order to fine-tune the opti- 
mal conversion rate. 

In a preferred manner of practicing the above method of 
optimizing the performance of a cell a number of speci- 

15 mens of said cell are transformed or transfected with 
their respective expression vector including DNA encoding 
a portion of the cytoplasmic part (Fi) of the membrane 
bound (FqFi type) H + -ATPase up to and including the en- 
tire Fi, said portion exhibiting ATPase activity , said 

20 DNA in the respective expression vectors being under the 
control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell, incubating each cell specimen on a suitable sub- 
strate, measuring the conversion rate of substrate into 

25 biomass or the desired product by each specimen, and 
choosing a specimen yielding an optimal conversion rate. 
In a more preferred embodiment of this manner, which is 
well suited to establish an optimal production strain, 
the respective expression vectors include DNA encoding 

30 different such portions of Fi up to and including the en- 
tire Fi, each DNA in respective expression vectors being 
under the control of each of a series of promoters cover- 
ing a broad range of promoter activities and functioning 
in said cell. 

35 

Also in this method of the invention the DNA encoding a 
portion of Fi up to and including the entire Fi may be 
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derived from a prokaryotic or a eukaryotic organism, and 
it may be either homologous or heterologous to said or- 
ganism. The specific DNAs mentioned above may also con- 
veniently be employed in this method. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1- A linear representation of the plasmids con- 
structed for modulating the cellular [ATP)/[ADP] ratio in 
10 E. coli (not drawn to scale) . 



Figure 2 • Effect of induction of Fi-ATPase activity on 
the growth of E. coll in batch culture. Cells were grown 
for more than 10 generations in minimal medium supple- 
15 mented with glucose (0.4 g/1), ampicillin (0.1 g/1) and 
the indicated concentration of inducer, IPTG. 



Figure 3. Effect of ATPase expression on the intracellu- 
lar concentration of ATP and ADP (concentration in arbi- 
20 trary units), and on the ratio [ATP]/[ADP]. 

Figure 4 Effect of increased ATPase expression on the 
glycolytic flux. 



25 DETAILED DESCRIPTION OF THE INVENTION 



Many biosynthetic reactions in the living cell (anabo- 
lism) , require an input of free energy (ATP), which is 
generated through a series of degrading reactions (cata- 

30 holism). In the aerobic cell, there are two routes for 
ATP synthesis: 1) substrate level phosphorylation, where 
an energy rich phosphoryl group is transferred directly 
from a high energy intermediate metabolite to ADP, and 2) 
oxidative phosphorylation, where the free energy is first 

35 transformed into redox free energy by oxidizing the en- 
ergy source, then into a proton gradient by respiration 
and finally the proton gradient is used by the H + -ATPase 
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to drive ATP synthesis from ADP and inorganic phosphate. 
In other cases, e.g. anaerobic growth, there is only the 
first route, substrate level phosphorylation, that can be 
used for ATP synthesis. An example hereof is the homo lac - 

5 tic LAB, where lactose is converted through the glyco- 
lytic pathway to lactic acid, which is excreted from the 
cells and thereby lowers the pH of the growth medium 
(usually milk products). With respect to ATP generation, 
homolactic fermentation is a very inefficient process, 

10 and only four moles of ATP are produced from 1 mole of 
lactose through substrate level phosphorylation. 

The anabolic (ATP consuming) and catabolic (ATP produc- 
ing) fluxes are normally well balanced in the living 

15 cell, and therefore, in the wild-type cell under normal 
growth conditions, the catabolic fluxes will be propor- 
tional to the anabolic fluxes. If a reaction is intro- 
duced that for instance hydrolyzes ATP in the cell and 
thereby lowers the cellular energy state (i.e. the 

20 [ATP]/[ADP] ratio), then either catabolism should in- 
crease or anabolism (growth) should decrease in order to 
make the consumption rate equal the production rate 
again. Which of these two scenarios will take place de- 
pends on whether, initially, the growth rate of the cell 

25 is limited through anabolism or through catabolism, i.e. 
whether there is a surplus or a shortage of energy in the 
cell to begin with. If there is a shortage of energy, 
then the rate of the anabolic reactions is limited by ca- 
tabolism and these reactions will be sensitive to changes 

30 in the cellular energy state. Introduction of an ATP- 
hydrolyzing reaction is then most likely to affect the 
growth rate of the cells. On the other hand, if there is 
a surplus of energy, then the growth rate will be limited 
mainly by the anabolic reactions; the rate of anabolism 

35 will be insensitive to a decrease in the energy state, 
but the catabolic rate may increase due to a decrease in 
product inhibition at lower [ATP] /[ADP] ratio. 
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In vitro, the Fi part of the H -ATPase complex has been 
shown to have ATPase actitity, see above. But so far no- 
body has managed to use the Fi complex to stimulate the 
glycolytic flux, or even to show that the Fi complex can 

5 hydrolyze ATP in intact ceils. Indeed, when we first 
tried to overexpress the FI complex, consisting of the 
genes for the subunits a g y, P and e, this had virtually 
no effect on the growth of E. coll, even when the genes 
were transcribed from the maximally induced tac promoter 

10 and on a very high copy number vector (derived from 
pUC18). One skilled in the art of gene expression in E. 
coll will appreciate that this combination is one of the 
most efficient expression systems that exists for this 
organism. 

15 

We then decided to try to express different combinations 
of subunits of the FI complex, in order to see if other 
combinations of subunits would be more powerful . Plasmids 
were constructed containing various combinations of the 

20 genes encoding the Fi part of the bacterial FiFo -ATPase 
complex from E. coll. The genes were expressed, either 
from an inducible (iac-type) promoter at various concen- 
trations of inducer or from a series of constitutive pro- 
moters of varying promoter activity. These plasmids 

25 should express various levels of ATPase activity when in- 
troduced into the bacterial cell . Depending on which Fi 
genes are present on the plasmid and the strength of the 
promoter which is used to drive the expression, we ob- 
served various degrees of inhibition of the growth of the 

30 cells harbouring these plasmids. Surprisingly, the beta 
subunit alone and in combination with the epsilon subunit 
turned out to be far more active in vivo than the entire 
FI complex. 

35 The objective of this work was to affect the energy state 
of the cells, as reflected in the ratio [ATP]/[ADP]. We 
therefore measured the intracellular concentration of ATP 
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and ADP in growing cells expressing various activities of 
Fi-ATPase. Indeed the ATP concentration decreased 
slightly with increasing ATPase activity and the ADP con- 
centration increased, and therefore the [ATP] /[ADP] ratio 

5 decreased (the effect on the ATP concentration was less 
than the effect on the ADP concentration as expected, see 
above) • We also calculated the glycolytic flux through 
the cells with various levels of ATPase activity. We 
found that the flux through the glycolytic pathway was 

10 first stimulated with increasing expression of ATPase ac- 
tivity, until a certain (optimal) ATPase activity which 
gave maximal glycolytic flux. Further increase of ATPase 
expression resulted in a lower glycolytic flux, due to a 
secondary effect of the ATPase activity on the growth of 

15 the cells. This emphasizes the need for optimization of 
gene expression rather than merely overexpressing the 
genes . 

EXAMPLE 1 

20 

ATP hydrolysis and enhanced glycolytic flux in Escheri- 
chia coli, using an inducible promoter 

Restriction enzymes, T4 DNA polymerase, calf intestine 
25 phosphatase (CIP) were obtained from Pharmacia. 

Procedures for DNA isolation, cutting with restriction 
enzymes, filling in sticky DNA ends with T4 DNA po- 
lymerase in the presence of dATP, dCTP, dGTP and dTTP, 
30 treatment with calf intestine phophatase to remove phos- 
phate groups from 5' DNA ends and ligation of DNA frag- 
ments are carried out by standard methods as described by 
Maniatis et al . , 1982. 
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Extraction and measurement of ATP and ADP 

0.9 ml of cell culture was mixed with 0.9 ml of (80 °C) 
phenol (equilibrated with 10 mM Tris, 1 mM EDTA pH=8) and 

5 immediately vortexed vigorously for 10 seconds . After 1 
hour at room temperature the sample was vortexed again 
for 10 seconds and the two phases were separated by cen- 
trifugation at 14000 rpm for 15 minutes, and then resid- 
ual phenol in the water phase was removed by extraction 

10 with 1 volume of chloroform, ATP and ADP concentrations 
were then measured, using a lucif erin-lucif erase ATP 
monitoring kit (obtained from and used as recommended by 
LKB, except that 3 mM of phosphoenol-pyruvate was added). 
[ATP] was measured first. Subsequently the ADP in the 

15 same sample was converted to ATP by adding pyruvate 
kinase, and [ADP] was recorded as the concomitant in- 
crease in luminescence. 

Construction of plasmids carrying combinations of the E. 
20 coli atp genes 

The following combinations of E. coli genes coding for Fi 
subunits were chosen for expressing ATPase activity in E - 
colli 1. atpAGDC (subunits a, y, P, e), 2. atpAGD (sub- 
25 units a, y, p), 3. atpDC (subunits p, e) , and 4. atpD 
(subunit P alone) • 

Cloning of fragments carrying atp genes onto pUC19 

30 The plasmid pBJC917 (von Meyenburg, K. , et al . , 1984) 
which carries the entire atp operon was cut with 

1) the restriction enzyme Dralll, and a 5009 bp DNA frag- 
ment containing the atpAGDC genes was isolated; 

35 

2) the restriction enzymes Drain and Tthllll, and a 4106 
bp DNA fragment containing the atpAGD genes was isolated; 
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3) the restriction enzymes Drain and SacII, and a 2364 
bp DNA fragment containing the atpDC genes was isolated; 

4) the restriction enzymes Aval and Tthllll, and a 1472 
5 bp DNA fragment containing the atpD gene was isolated. 

In all four cases the fragments were then treated with T4 
DNA polymerase to create blunt ends, and subsequently the 
fragments were ligated into the cloning vector pUC19 
10 (Yanisch-Perron et al. , 1985) which had been cut with Smal 
and treated with CIP. 

The four ligation mixtures were transformed into the E. 
coll strain JM105 (Yanisch-Perron et ai., 1985), and the 

15 transformation mixtures were plated on LB ( Luria-Bertani 
broth; Maniatis et al. f 1982) plates containing 100 ug/ml 
ampicillin and 75 ug/ml 5-bromo-4-chloro-3-indolyl-p-D- 
galactoside (X-gal). In this strain background (JM105), 
plasmids formed by religation of pUC19 will give blue 

20 colonies, whereas plasmids that carry foreign DNA frag- 
ments inserted into the Smal site of pUC19, will give 
white colonies. A number of white colonies from the four 
transformations were therefore picked for further analy- 
sis: plasmid DNA was isolated and analysed by cutting 

25 with various restriction enzymes. Clones were identified 
from each of the four series which had the desired frag- 
ment inserted into the Smal site of pUC19, and in the 
proper orientation. These four plasmids were named, re- 
spectively: pATP-AGDC, pATP-AGD, pATP-DC and pATP-D, with 

30 reference to the specific atp genes carried by the plas- 
mid. 

Cloning combinations of the atp genes under the control 
of an inducible ( tac) promoter 

35 

In order to be able to control the expression of the ATP- 
ase activity, we selected the expression vector pTTQIB 
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(Starck, 1987). This vector is a derivative of pUC18 
(Yanisch-Perron et ai.,1985), which carries a tac pro- 
moter and the lactose repressor gene, lacl. Immediately 
downstream of the tac promoter is a multiple cloning site 
5 (MCS; the polylinker from pUC18) in which genes can be 
inserted to be expressed from the tac promoter. The tac 
promoter is of the lac- type , i.e. repressed by the lac- 
tose repressor and inducible with isopropyl-0-D- 
thiogalactoside (IPTG) . 

10 

The four plasmids, pATP-AGDC, pATP-AGD, pATP-DC and pATP- 
D were cut with Kpnl and Xbal , which gave the four DNA 
fragments, 5023, 4120, 2378 and 1486 respectively. After 
purification, the fragments were ligated into the cloning 

15 vector, pTTQ18, which had also been cut with Kpnl and 
Xbal (see figure 1). The ligation mixtures were trans- 
formed into E. coll K-12 MC1000 (Casabadan and Cohen, 
1980), and the transformation mixtures were plated on LB 
plates containing 100 ng/ml ampicillin. A number of colo- 

20 nies from the four transformations were therefore picked 
for further analysis: plasmid DNA was isolated and ana- 
lysed by cutting with various restriction enzymes. Clones 
were identified from each of the four series which had 
the desired fragment inserted into the MCS of pTTQ18 in 

25 the proper orientation. These four plasmids were named, 
respectively: pTAC-AGDC, pTAC-AGD, pTAC-DC and pTAC-D, 
with reference to the specific atp genes carried by these 
plasmid and the tac promoter used for their expression. 
For the purpose of subsequent physiological studies, the 

30 plasmids were transformed into the E. coli K-12 strain 
LM3118, which is used routinely for physiological experi- 
ments in this laboratory. The corresponding names for the 
LM3118 strain carrying these four plasmids are PJ4 332, 
PJ4333, PJ4335 and PJ4334, respectively. 



35 
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Effect of induction of ATPase activity on the growth of 
E. coli on plates 

The strains containing the four plasmids were streaked on 

5 LB plates containing ampiciliin {100 ug/ml) and 1 mM of 
IPTG which should give maximum expression from the tac 
promoter. Table I shows how the four strains responded: 
the strain carrying plasmid pATP-AGDC, which contains the 
genes for the four subunits, a, y, P and e, was only very 

10 slightly affected in growth , even in the presence of 1 mM 
IPTG. The other three plasmids, pTAC-AGD, pTAC-DC and 
pTAC-D caused severe growth inhibition in the presence of 
1 mM IPTG, where colonies were no longer visible . With 
intermediate concentrations of IPTG, 0.01 mM and 0-1 mM, 

15 the plasmids affected the growth of their host cells to 
different extents: pTAC-AGD was the most active, giving 
rise to a strong inhibition of growth already with 0.01 
mM IPTG, a concentration which gave only a slight inhibi- 
tion with the plasmid pTAC-DC and no inhibition of the 

20 strain with pTAC-D. With 0.1 mM IPTG, colonies were 
hardly visible for the strain that carried the pTAC-AGD, 
the plasmid pTAC-DC caused strong growth inhibition, 
whereas the effect of pTAC-D was significant but small. 

Table I 

Strain Plasmid - IPTG 0.01 mM IPTG 0 . 1 mM IPTG 1 mM IPTG 



PJ4 332 pTAC-AGDC ++++ ++++ ++++ +++ 

PJ4333 pTAC-AGD ++++ ++ + 

PJ4335 pTAC-DC ++++ +++ + 

PJ4334 pTAC-D ++++ ++++ ++ 

++++ ca normal colony size; +++ c slight inhibition; ++ = 1/2 normal size; 
+ = 1/10 normal size; - = no growth 



The effect of ATPase expression from the four plasmids 
above was also studied in the E . coli mutant LM3115, in 
which the entire atp operon on the chromosome is deleted, 
5 but which grows with almost wild- type growth rate on LB 
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medium. With this strain transformed with the four plas- 
mids we observed a similar pattern of growth inhibition 
on LB plates as a function of IPTG concentration. This 
shows that the effect of ATPase expression was independ- 
5 ent of the presence of the normal atp operon. 

Effect of induction of ATPase activity on the growth of 
E* coll in liquid cultures 

10 The effect of induction of ATPase was also studied with 
cells grown in liquid cultures. For this purpose we chose 
the strain PJ4333, carrying the plasmid pTAC-AGD, because 
this plasmid appears to be the most active with respect 
to the inhibitory effect on the of growth of E. coll. 

15 Figure 2 shows the growth of PJ4333 in minimal medium 
supplemented with a limiting concentration of glucose 
(0.4 g/1) and ampicillin (0.1 g/l), without IPTG and in 
the presence of increasing concentrations of IPTG. We ob- 
served that the growth rate of the strain was practically 

20 constant (within some 10%) with increasing amounts of 
IPTG up to about 30 ^M. At higher than 40 \m IPTG, the 
growth of the cells were slightly inhibited, in accor- 
dance with the experiments on plates, see above. 

25 However, what was affected was the final density of cells 
that one obtains from the limited amount of glucose that 
was included in each culture: The more ATPase that is ex- 
pressed in the cells, the lower the yield of cell mass. 
Apparently, the cells become less economic with respect 

30 to converting the glucose into biomass, or in other words 
they consume more glucose per cell synthesized. If this 
is due to the expression of ATPase activity, then we 
would expect to see an effect hereof on the energy state 
of the cells. We therefore measured the concentrations of 

35 ATP and ADP in the cells growing with different expres- 
sion levels of ATPase activity. 
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Indeed, the intracellular ATP concentration decreased 
gradually and the ADP concentration increased, with in- 
creased expression of ATPase; therefore the [ATP] /[ADP] 
ratio decreased with increased expression of ATPase, 

5 which imply that the increased glucose consumption is the 
result of increased ATP convertion to ADP, see figure 3 . 
The actual flux of glucose through the cells (Jgiuc# mmol 
glucose / g cell dry weight / hour) is also interesting, 
because this value tells us whether the performance of 

10 the cell increased as the ATPase activity increased* 
Jgluc can be calculated from the yield, Y (g cell dry 
weight / mol glucose) and the specific growth rate of the 
culture, ix ( 1 /hours) : 

15 Jgluc = H/Y 

Figure 4 shows how the flux of glucose changed as the ac- 
tivity of ATPase increased: the glycolytic flux increased 
gradually as the ATPase expression increased, until a 

20 maximum was reached (at 30 pM IPTG) . Further increase of 
ATPase expression had a slightly negative effect on the 
glucose flux. This was probably because the energy state 
of the cells became so low that this had a negative ef- 
fect on some anabolic reactions, since the growth rate 

25 was lower for the culture that was grown in the presence 
of 40 MM IPTG. 

The expression of subunits of the Fi part of the bacte- 
rial H + -ATPase lowers the energy state of the bacterial 

30 cell. This is due to hydrolysis of ATP into ADP and Pi . 
The expression of ATPase activity does not affect the 
growth rate of E* coll much at low levels of expression, 
but the efficiency by which the substrate is converted 
into biomass was strongly reduced. Under the set of con- 

35 ditions used here, the expression of ATPase activity has 
a stimulatory effect on the rate by which the cells con- 
sumes the exogenous glucose . 
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EXAMPLE 2 

Expression of Fi-ATFase activity from constitutive pro- 
moters in E* coli 

5 

In example 1 we used a lac-type promoter system to modu- 
late the expression of the Fi ATPase subunits in E. coli. 
However, for the optimization of gene expression for in- 
stance in industrial bioreactors or for the use in fer- 

10 merited food products, the use of lac type promoters is 
not always feasible. In this example we illustate the op- 
timization of Fi-ATPase expression in E* coli, using a 
series of constitutive promoters of different strength, 
to control the expression of the atpAGD genes which here 

15 originates from E. coli. The constitutive promoters (CP 
promoters) were selected from a library of artificial 
promoters which had previously been cloned onto a shuttle 
vector for E. coli and L. lactis , pAK80 (Israelsen et 
ai., 1995) as described in our co-pending PCT patent ap- 

20 plication PCT/DK97/00342 . The selected plasmid deriva- 
tives of pAK80 were pCP34, pCP41 and CP44 (CPX cloning 
vectors). The atpAGD fragment from pTAC-AGD (from example 
1) was first subcloned in a polylinker in order to have 
the atpAGD fragment flanked by two BamHI sites. Subse- 

25 quently, this BamHI fragment was cloned into the unique 
BamHI site downstream of the CP promoters on the plasmids 
pCP34, pCP41 and CP44, resulting in the plasmids, 
pCP34 : : atpAGD, pCP34 : : 2 atpAGD, pCP41: : atpAGD and 
CP44 : tatpAGD, where pCP34 :: 2a tpAGD contains two atpAGD 

30 fragments in tandem. 

Subsequently, the strains were characterized with respect 
to growth rate, growth yield and glycolytic flux in glu- 
cose minimal medium supplemented with 200 ^g/ml erythro- 
35 mycin, essentially as described in example 1, see table 
2. 
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The expression of the Fi-ATPase subunits had a slightly 
negative effect on the growth rate as the expresion level 
increased. The effect on growth yield was much stronger 
and at the highest expression level the growth yield had 
5 dropped to 40 % of the initial value. The glycolytic flux 
was stimulated 70% at the highest expression level of AT- 
Pase, and at this expression level the growth rate was 
lowered by 30%* 

Table 2. Effect of expression of uncoupled Fi-ATPase activ- 



ity (E. coli 


a, y$ P 


subunits ) 


in E. coli 






Plasmid 


Biomass 
yield 

gdw/mmol 
glucose 


Growth 
rate , u 

h-1 


Glucose 
flux 

mmol glu- 
cose/h/gdw 


Biomass 
yield 

% 


Growth 
rate 


Glucose 
flux 


pCP41 


0,067 


0,47 


6,9 


100 


100 


100 


pCP41 : : atpAGD 


0,047 


0,42 


9,1 


69 


90 


131 


pCP34 


0,063 


0, 41 


6, 6 


100 


100 


100 


pCP34 : : atpAGD 


0,034 


0, 34 


9,9 


54 


81 


149 


pCP44 


0,067 


0,44 


6,5 


100 


100 


100 


pCP44 : : atpAGD 


0,027 


0,30 


11,2 


40 


69 


172 



EXAMPLE 3 

Expression of E. coli Fi-ATPase activity from constitu- 
5 tive promoters in L* lactis. 

The plasmids from example 2 which express the E. coli 
Fi-ATPase subunits to various extent are also capable of 
replicating in Z#. lactis, and could therefore be used to 
10 test whether the E. coli F^-ATPase subunits can be used 
to hydrolyse ATP in L. lactis. 
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The plasmids pCP34 : : atpAGD, pCP34 : : 2atpAGD and 
pCP4 1: zatpAGD, were transformed into the L. lactls sub- 
species cremoris strain, MG1363, which is used routinely 
for physiological experiments in this laboratory. In ad- 

5 dition we transformed the respective vectors, pCP34 and 
pCP41 in order to have proper control strains. Subse- 
quently, the resulting transformants were characterized 
with respect to growth rate, growth yield and glycolytic 
flux, in comparison to the respective vectors, pCP34 and 

10 pCP41, by growing the various cultures in defined medium 
(SA medium) supplemented with a limiting concentration of 
glucose (0.1%), see table 3. 



Table 3. Expression of E. coll Fi-ATPase in L. lactls 



Plasmid 


Biomass 
yield 

gdw/mmol 
glucose 


Growth 
rate, p 

h-1 


Glucose 
flux 

rranol glu- 
cose/h/gdw 


Biomass 
yield 

% 


Growth 
rate 


Glucose 
flux 


pCP34 




0,073 


0, 664 


9,161 


100 


100 


100 


pCP34 : 


: a tpAGD 


0, 071 


0, 653 


9,230 


97,5 


98,3 


100,8 


pCP34 : 


-.ZatpAGD 


0, 069 


0, 655 


9,560 


94, 6 


98, 7 


104, 4 


pCP41 




0,072 


0, 645 


8,925 


100 


100 


100 


pCP41: 


: a tpAGD 


0, 070 


0, 590 


8, 461 


96, 5 


91,5 


94, 8 



The results show that the plasmids pCP34 : : atpAGD and 
pCP34 s : 2atpAGD did affect the growth yield and the glyco- 
lytic flux to some extent, but the plasmids were far less 
5 efficient in L. lactls, compared to E* coll. This was 
probably a consequence of a lower expression of the £*. 
coll ATPase subunits, or some of these, in L. lactls, due 
to a lower copy number of the pAK80 vector in L. lactls 
(5-10), and due to differences in the translational 
10 effciency of the three individual atp genes which origi- 
nates from E. coli* The plasmid pCP41: xatpAGD also re- 
sulted in a lower growth yield, indicating that also in 
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this case uncoupled ATP hydrolysis was taking place. How- 
ever, the pCP4 1 : : atpAGD plasmid had a relatively strong 
inhibitory effect on the growth rate and therefore the 
glycolytic flux was not increased by this plasmid. It is 
5 possible that the heterologous expression of B, coli AT- 
Pase subunits resulted in growth inhibition due to ef- 
fects other than ATP hydrolysis, e.g. by interfering with 
the function of the L. lactis FiFq H + -ATPase complex. 

10 EXAMPLE 4 

Expression of L* lactis Fi-ATPase subunits P and in I>. 
lactis. 

15 In the example above we showed that the expression of Fj- 
ATPase subunits from E. coli in X*. lactis, resulted in 
only a small stimulation of the glycolytic flux. It is 
possible that the heterologous expression of E* coli AT- 
Pase subunits resulted in growth inhibition due to ef- 

20 fects other than ATP hydrolysis, e.g. by interfering with 
the function of the L. lactis FiFo H + -ATPase complex. In 
the present example we have expressed the L. lactis Fi- 
ATPase subunits, P and e, in L. lactis, as this appeared 
to be an effective combination of subunits when expressed 

25 in E. coli, see example 1* The atpDC^c genes from L. 
lactis subspecies cremoris (SEQ ID No. 1) was cloned on a 
2.5kb Hindlll fragment into the Hindlll restriction site 
on the standard cloning vector, pBluescript, into E. coli 
K-12, strain BOE270. Subsequently, the atpDC^ic genes 

30 were cut out on a 2.5kb BamHI-Sall fragment and cloned 
into 5 expression vectors, pCP32, pCP34, pCP37, pCP41 and 
pCP44 which had been digested with BamHI and Sail, re- 
sulting in the plasmids pCP32 : : atpDC^o * pCP34 : : atpDC^ic , 
pCP37: zatpDCitic, pCP41 : : atpDCLlc a™* pCP44 : : a tpDCj^ic , re- 

35 spectively, where the lacLM genes downstream of the CP 
promoters, have been replaced with the atpDC^iQ genes. 
These plasmids should express the L. lactis Fi-ATPase 
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subunits, 0 and e, to different extent. The plasmids were 
then transformed into MG1363 with selection for the 
erythromycin resistance carried by these vectors. Experi- 
ments were then performed to test whether the constructs 
resulted in convertion of ATP into ADP in L. lactis . The 
strains carrying the different constructs was then grown 
in GM17 medium supplemented with 5 ^ig/ml erythromycin. 
The plasmids did not have a strong effect on the growth 
rate of the cultures, which remained close to the growth 
rate of the respective vector control plasmids. The yield 
of biomass , however, decreases for all the cultures by up 
to 17%, which shows that the cultures did indeed express 
uncoupled ATPase activity, see table 4. 



Table 4. Effect of expression of L. lactis 0 and e 
subunits on acid production by L . lactis, at 30°C and 
with initial pH 6.7. 



Plasmid 


Biomass* 

OD450 


Final pH* 


Acid formation, 
relative to biomass* 

% of vector 


pCP34 


5. 08 


4.27 


100 


pCP34: zatpDCllc 


4.72 


4.31 


98.0 


pCP41 


4. 66 


4 .34 


100 


pCP41: latpDCllc 


5.21 


4.24 


113.5 


pCP37 


4.69 


4.28 


100 


pCP37: latpDCllc 


4.63 


4.24 


116.1 


pCP32 


4.86 


4 . 34 


100 


pCP32: latpDCllc 


3.95 


4.36 


116.9 



production was calculated from the pH change, and normalized by the 
biomass produced. 



The GM17 growth medium used in these experiments contains 
a surplus of glucose (1%), and growth only stops when the 
pH of the growth medium becomes lower than approximately 
pH 4.3. To some extent, this mimics the situation that 
the lactic acid bacteria experience during cheese and 
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yougurt production. In this medium, the growth yield, in 
terms of the final cell mass of the cultures, reflects 
the acid production by these cultures. 

5 In these cultures , the expression of Fi-ATPase subunits 
will increase three fold at approximately OD600 equal to 
1.5. This is a consequence of the three fold amplifica- 
tion of the plasmid copy number that has been shown to 
take place at this point of the growth curve. In reality, 
10 the effect of expressing the Fi-ATPase subunits may 
therefore be larger. 

To test this hypothesis, we grew some of the strains 
which expressed the L. lactis Fj-ATPase subunits p and e 
15 in batch cultures of GM17 medium which had been adjusted 
to pH 5.9, see Table 5. In addition, the temperature of 
the growth medium may also affect the plasmid copy number 
and thus the expression of the F t -ATPase subunits. The 
experiments were therefore performed at 37 °C. 

20 

Table 5. Effect of expression of L. lactis P and e 
subunits on acid production by L. iactis, at 37 °c and 
with initial pH 5.9. 



Plasmid 


Biomass" 

OD<i 5 o 


Ffinal pH" 


Acid formation, 
relative to bioraass* 

of vector 


pCP34 




1.24 


4.95 


100 


pCP34 


: :atpDC Uc 


1.06 


4.87 


141.4 


pCP37 




1.00 


4.96 


100 


pCP37 


: :atpDC Uc 


0.58 


4.92 


188.4 



25 

Clearly, the effect of the FT-ATPase activity was much 
stronger under these growth conditions: the amount of 
acid produced was almost doubled for the strain carrying 
the plasmid pCP37 :: a tpDC 1Ic . 



30 
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EXAMPLE 5 

Expression of the Fi-ATPase subunits, a, y, and 0, from 
X. lactls subspecies cremorls in L. lactls subspecies 
5 cremoris . 

In example 4, only the L. lactls Fj-ATPase p and e 
subunits were expressed in L. lactls. However , from the 
experiments with E. coll (example 1), we know that the 

10 simultaneous expression of subunits a, y, and £J, is a 
more powerful combination, which could also be the case 
for L. lactls. In order to obtain the same strong stimu- 
lation of the glycolytic flux and acid production in L. 
lactls, a set of vectors similar to the vectors described 

15 in example 4 was constructed, in which the atpAGD^ic 
genes derived from I#. lactls, encoding the subunits a, y, 
and 0 (SEQ ID No, 1) was expressed from CP promoters with 
different activities . The atpAGD^\ c genes from L. lactls 
was cloned on a 2.5 kb BamHI-Sall fragment into the 5 

20 vectors, pCP32, pCP34, pCP37, pCP41 and pCP44, resulting 
in the plasmids, pCP32 : : atpAGD^Xc * pCP34 : : a tpAGE>i>\ c , 
pCP37 s : atpAG£?Llc / pCP41 : : atpAGD^ic , pCP44 : : atpAGDj^ic , re- 
spectively, where the lacLM genes downstream of the CP 
promoters, has been replaced with the atpAGD^ic genes. 

25 These plasmids will express the L. lactls Fi-ATPase 
subunits a, y, and 3, to different extent. The plasmids 
were transformed into MG136 3 with selection for the 
Erythromycin resistance carried by these vectors. Experi- 
ments were then performed to show that the constructs 

30 were effective in ATP hydrolysis in L. lactls and to what 
extent the glycolytic flux was enhanced, by growing the 
five different constructs in GM17 medium supplemented 
with erythromycin, and measuring the growth rate, ATP and 
ADP concentrations, the yield of biomass and the rate of 
35 acid production. 
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EXAMPLE 6 

Expression of Fi-ATPase subunits from L. lactis subsp 
lactis, in L. lactls subspecies lactls. 

5 

In the examples 3-5 above, we used the strain L. lactls 
subsp. cremoris, MG1363. This strain is plasmid-f ree and 
is used routinely in our laboratory as a simple model or- 
ganism for our physiological studies . But strains belong- 

10 ing to the subspecies lactis are also important in cheese 
production. We therefore cloned and sequenced the at- 
pAGDi.ll genes from L. lactis subsp. lactis f (SEQ ID No. 
6). Subsequently, a 4.2 kb fragment habouring the at- 
pAGD^w genes was cloned into 5 vectors, pCP32, pCP34, 

15 pCP37, pCP41 and pCP44, resulting in the plasmids, 
pCP32: :atpAG£>Lii, pCP34 : : atpAGD^n , pCP37: latpAGD^n, 
pCP41: zatpAGD^n, pCP44 : : atpAGD^n , respectively. These 
plasmids were then transformed into L. lactis subsp. lac- 
tis as described in example 3 . The resulting strains with 

20 different expression levels of the Fi-ATPase subunits a, 
y and P were then used to characterize the effect hereof 
on the growth yield, growth rate, glycolytic flux, and 
the cellular energy state of L. lactis subsp. lactis, as 
described in the examples 1-5. 

25 

EXAMPLE 7 

Expression of Fl-ATPase subunits from S. thermophilus, 
ST3, in S. thermophilus, ST3 

30 

In the examples 3-6 above, we used strains of the genus 
Lactococcus. These strains are important in cheese pro- 
duction. As starter cultures for yougurt production, the 
dairy industry often uses strains of S. thermophilus. We 
35 therefore cloned and sequenced the atpAGDst genes from S. 
thermophilus, strain ST3 (SEQ ID No. 10). Subsequently, a 
4.2 kb fragment habouring the atpAGDst genes was cloned 
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into the 5 vectors, pCP32, pCP34, pCP37 , pCP41 and pCP44, 
resulting in the plasmids, pCP32 : zatpAGD$t t 

pCP34: zatpAGDstf pCP37: zatpAGD$t* pCP41: ;atpAGZ?st/ 

pCP44: zatpAGDstf respectively. These plasmids were then 
5 transformed into £. thermophilics strain ST3 . The result- 
ing strains have different expression levels of the Fx- 
ATPase subunits a, y f and P, and were then used to char- 
acterize the effect hereof on the growth yield, growth 
rate, glycolytic flux, and the cellular energy state of 
10 S. thermophil us , as described in the previous examples. 

EXAMPLE 8 

Expression of a truncated Fi-ATPase P subunit from Phaf- 
15 fia rhodozyma in Saccharomyces cerevisiae 

In this example we show that uncoupled Fi-ATPase expres- 
sion can also be used to hydrolyze ATP in yeast cells of 
Saccharomyces cerevisiae . 

20 

A cDNA gene library was prepared from total RNA, isolated 
from Phaffia rhodozyma, by cloning the cDNA fragments 
into the expression vector, pYES2.0. One of the resulting 
plasmids, pATPbeta, gave rise to an ade + phenotype in the 
25 Saccharomyces cerevisiae strain, W301 , which carries a 
mutation in the ADE2 gene. Sequencing of the clone re- 
vealed a 0.9 kb insert, which encoded a protein of 254 
amino acids (SEQ ID No. 14). The encoded protein had a 
very high homology to the C- terminal part of Fx-ATPase p 
subunits from other organisms, prokaryotic as well as eu- 
karyotic, including the P subunit from S. cerevisiae (86% 
identity) . 

The ADE2 mutation results in starvation for an intermedi- 
ate further down in the purine metabolism, AICAR (which 
under normal conditions is produced by ADE3 , two steps 
further down in this pathway) . AICAR is essential for de 
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novo biosynthesis of AMP and GMP, and KDE2 mutants there- 
fore need an alternative purine source in the growth me- 
dium. However , there is an alternative route for synthe- 
sis of AICAR which involves some of the genes involved in 

5 histidine biosynthesis. These genes are normally re- 
pressed under the conditions used for the complementa- 
tion/ but when the HIS3 gene is introduced on a plasmid, 
this complements the ADE2 mutation because the cells 
start to produce AICAR. Since AICAR is a precursor for 

10 ATP, it is likely that a lack of ATP (or increased levels 
of ADP and AMP) provide a signal to derepress the HIS3 
gene and generate AICAR (which will subsequent] y end up 
as ATP) . Indeed/ cross-pathway regulation between purine 
and histidine biosynthesis has been found in yeast and 

15 involves the transcription factors BAS1 and BAS2 . A rea- 
sonable explanation for the ade* phenotype conferred by 
the plasmid/ is therefore that the plasmid gives rise to 
ATP hydrolysis in the cytoplasm, thereby effecting the 
concentrations of adenine nucleotides in the cytoplasm. 

20 

Importantly, this truncated P subunit from Phaffla rho- 
dozyma that was encoded on pATPbeta, included the region 
of the P subunit which is thought to encode the catalytic 
site for ATP hydrolysis. The truncation of the N-terminal 
25 part of the p subunit probably means that the protein 
will no longer be exported into the mitochondrion, but 
should stay within the yeast cytoplasm. 

The truncated p subunit pATPbeta is expressed from a gal 
30 promoter, i.e. it can be induced with galactose. If the 
truncated p subunit encoded by the clone is active in ATP 
hydrolysis it should result in a decrease in the growth 
yield, and at sufficiently high expression level, we 
should also observe inhibition of growth. The strain 
35 which expressed the truncated P subunit and a control 
strain (which contained a plasmid pHIS3 containing a HIS3 
gene from Phaffia rhodozyma) , were streaked on plates 
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containing galactose as the energy source, which will 
give maximal expression of the truncated p subunit. In- 
deed, the growth of the strain which expressed the trun- 
cated p subunit was strongly inhibited by the presence of 
5 galactose, whereas the control strain grew normally. As a 
control, the growth of the two strains were also compared 
on a plate containing glucose as the energy source, con- 
ditions under which the expression of the p subunit 
should be repressed, and indeed we observed little dif- 
10 ference in growth of the two strains on these plates, see 
table 6 . 

Subsequently, for the purpose of the physiological inves- 
tigations, the two strains were converted into Rho~ 

15 strains (petit mutants, defective in oxidative phosphory- 
lation) by standard treatment with ethidium bromide. The 
induction with galactose caused even stronger inhibition 
of growth in the Rho~ background, which further indicates 
that the cause of the growth inhibition is uncoupled ATP 

20 hydrolysis in the cytoplasm. 

Table 6. Effect of expression of a truncated Fi-ATPase P 
subunit from Phaffla rhodozyma in S. cerevlsiae on SC 
plates 

25 



Strain/plasmid 


SC-ura + glucose 


SC-ura + galactose 


Rho + /pATPbeta 


+++++ 


+ 


RhoVpHIS3 


++++ 


+++ 


Rho~/pATPbeta 


+++++ 




Rho"/pHIS3 


++++ 


+++ 



Growth experiments were performed to measure the result- 
ing changes in the ATP/ADP ratio and the degree of stimu- 
lation of the glycolytic flux and ethanol formation, es- 
30 sentially as described in the examples above, and to show 
that the truncated p subunit from Phaffia rhodozyma is 
active with respect to converting ATP into ADP in the 
yeast cell. 
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EXAMPLE 9. 

Expression of Fi-ATPase P subunit from Tri ch odLerma. reesei 
in Saccharomyces cerevlslas. 

5 

In this example we show that the expression of the Fi- 
ATPase P subunit from the filamentous fungus, Trichoderma 
reesei can be used to improve the product formation of 
Sa ccharomyces cerevi siae, 

10 

The gene encoding the FrATPase P subunit homologue from 
Trlchoderma reesei was isolated from a cDNA library, in- 
serted into a multicopy expression vector, pAJ401. DNA 
sequencing (SEQ ID 16) revealed that the cloned gene had 

15 very high homology to the p subunit s from Neurospora 
crassa (91% identity), Kluyveromyces lactis (68%) and 
Saccharomyces cerevisiae (68%) . Importantly, the first 43 
amino acids in this P subunit, which encodes the signal 
for exporting the protein into the mitochondria, was ho- 

20 mologous to the N-terminal part of the p subunit from 
Neurospora crassa (58% identity), but not to that of Sac- 
charomyces cerevisiae. It is therefore likely that the P 
subunit from Trichoderma reesei will stay within the cy- 
toplasm when expressed in Saccharomyces cerevisiae . This 

25 is important for the many cases where the fermentation is 
carried out anaerobically, because in these cases it is 
probably most efficient if the ATP hydrolysis takes place 
in the cytoplasm. Alternatively, in those cases where the 
P subunit is transported into the mitochondrion, it may 

30 be useful to genetically modify the P subunit so that is 
stays within the cytoplasm. 

The gene encoding the Fj-ATPase P subunit homologue from 
Trichoderma reesei was expressed in S. cerevisiae strain 
35 VWlb (MAT alpha, leu2-3/112 t ura3-52, trpl-289, his3Dl, 
MAL2-8c, SUC 2) . To test whether the presence of the T. 
reesei P subunit resulted in ATP hydrolysis in the cyto- 
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plasm of the Saccharomyces cerevisiae host cells, we 
measured the intracellular concentrations of ATP, ADP and 
AMP, under various growth conditions in cultures of two 
strains expressing the p subunit (pATPP34 and pATPP44) 
5 and a strain carrying the vector plasmid, pFL60, see ta- 
ble 7. 

Table 7. Effect of expression of T. reesei p subunit on 
ATP, ADP and AMP concentrations* in S. cerevisiae 
it) 



Strain 


ATP ADP AMP 
Vimol/gdw ymol/gdw jjmol/gdw 


ATP/ADP ratio 


Aerobl c/exp . phas e 
pATPp34 
pATPP4 4 
pVECTOR 


19.3 5.58 3.31 
13.9 5.15 3.25 
16.6 5.47 3.43 


3.5 
2.7 
3.0 


Aerobic/stat .phase 
PATPP34 
PATPP4 4 
pVECTOR 


9.30 4.03 2.89 
8.99 3.90 2.42 
19.5 4.62 2.87 


2.3 
2.3 
4.2 


anaerobic/sta t .phase 
pATPP34 
PATPP44 
pVECTOR 


4.39 11.6 6.72 
3.14 10.5 6.65 
8.84 10.2 6.37 


0 . 4 
0.3 
0.9 



according to Bergmeyer (1985) 



The p subunit did not appear to have a significant effect 
on the concentrations of ATP, ADP and AMP in cells grow- 

15 ing on glucose in the exponential growth phase. The rea- 
son is probably that the ATP concentration that the ho- 
meostatic control of ATP synthesis can here keep up with 
the extra drain on ATP conferred by the P subunit F,- 
ATPase activity. Indeed, the growth rate of these cul- 

20 tures was unaffected by the presence of the Fi-ATPase ac- 
tivity, see table 7. But in the stationary cultures the 
concentration of ATP decreased significantly in the cul- 
tures expressing the P subunit/ compared to the control. 
The effect was strongest in the anaerobically grown cul- 

25 tures where the ATP was lowered by a factor of 2-3 . In 
these cultures, ATP must be generated through oxidative 
phosphorylation, (which is not even an option for the an- 
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aerobic cultures), and any effect of uncoupled ATP hy- 
drolysis should therefore indeed be stronger in these 
cells . 

5 Shake flask cultivations of cultures expressing the Fi- 
ATPase p subunit homologue in Saccharomyces cerevisiae . 

Shake flask cultivations were performed under microaero- 
bic/anaerobic conditions with volume ratio 1/1.25 and no 

10 agitation; with 400 ml growth media in 500 ml Erlenroeyers 
on magnetic stirring. The growth media contained 5 g/1 of 
glucose and amino acids and bases according to synthetic 
complete medium (SC-ura+0 . 5%G) . OD fil - HJ was monitored during 
the cultivation (OD600=1.0 is equal to 0.3 g/1 dry 

15 weight) . Ethanol and glucose were measured with HPLC 
(Waters, Sugar-Pak or IC-Pak columns) . Production of 
ethanol (grams of ethanol per grams of cell dry weight) 
is shown in Table 8. 

20 Table 8. Effect of expression of T. reesei P subunit, 
on fluxes of ethanol and glucose in s. cerevisiae 



Strain 






J«toh 








h" 1 


g/h/gdw 


g/h/gdw 


relative 
to con- 
trol 


relative to 
control 


pATPp34 


0.40 


2.811 


1.190 


107 .7 


105.6 


pATPj*4 4 


0.40 


2.750 


1.167 


105.3 


105 .3 


pVECTOR 
control 


0.39 


2.611 


1.127 


100 


100 



These data show that the presence of the T. reesei F)- 
25 ATPase 3 subunit resulted in an increased flux of glu- 
cose, as well as ethanol, in the Saccharomyces cerevisiae 
host cells. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Peter Ruhdal JENSEN 

(B) STREET: Soegaardsvej 19 

(C) CITY: Gentofte 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-2820 

(ii) TITLE OF INVENTION: A method of improving the production of 
biomass or a desired product from a cell 

{iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(CJ OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 963/96 
{B} FILING DATE: 06-SEP-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( iii ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. cremoris 

(B) STRAIN: MG1363 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 
(Bi LOCATION: 26. .550 

(D) OTHER INFORMATION : /codon_start= 26 
/product^ "ATPase subunit" 
/gene= "atpH" 

/standard_name= "delta subunit of the Fl portion 
of the F0F1 ATPase" 
/label- delta-subunit 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 742. .2241 

(D) OTHER INFORMATION: /codon_start= 742 
/product^ "ATPase subunit" 
/gene= "atpA" 

/standard name= "alpha subunit of the Fl portion 
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of the F0F1 ATPase" 
/label= alpha-subunit 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 22 60. .3126 

(D) OTHER INFORMATION :/codon_start= 2260 
/product= "ATPase subunit" 
/gene= "atpG" 

/standard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/label= gamma- subunit 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(8) LOCATION: 3301. .4707 

(D) OTHER INFORMATION :/codon_s tart- 3301 
/product= "ATPase subunit" 

/gene= "atpD" ^ ^ . _ 

/standard_name= "beta subunit of the Fl portion of 

the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TATCTCGCTA AGTTAGGAGA ATAAG ATG ACA AAA GTA AAT TCA CAA AAA TAC 
iA Met Thr Lys Val Asn Ser Gin Lys Tyr 

1 5 

ACT AAA GCT TTA CTT GAG GTC GCC CGA GAA AAA GGA CAA CTT GAA GCA 
Ser Lys Ala Leu Leu Glu Val Ala Arg Glu Lys Gly Gin Leu Glu Ala 
10 15 20 " 

ATT CTT ACT GAA GTT AGC GAA ATG ATT CAG CTT TTC AAA GAA AAT AAC 
lie Leu Thr Glu Val Ser Glu Met He Gin Leu Phe Lys Glu Asn Asn 
30 35 

TTA GGT GCT TTT TTA GCA AAT GAA GTT TAT TCA TTC TCT GCT AAA TCT 
Leu Gly Ala Phe Leu Ala Asn Glu Val Tyr Ser Phe Ser Ala Lys Ser 
45 50 55 

GAA TTG ATT GAT ACT TTG CTT CAA ACT TCA TCA GAA GTG ATG TCA AAT 244 
X£ leu lie Asp Thr Leu Leu Gin Thr Ser Ser Glu Val Met Ser Asn 
60 65 70 

TTC CTG AAT ACT ATT CGT TCT AAT GGA CGT CTA GCT GAC CTC GGA GAA 292 
Phe Leu Asn Thr He Arg Ser Asn Gly Arg Leu Ala Asp Leu Gly Glu 
75 80 85 

ATA CTT GAA GAA ACT AAA AAT GCA GCA GAT GAC ATG TTC AAA ATT GCT 340 
He Leu Glu Glu Thr Lys Asn Ala Ala Asp Asp Met Phe Lys He Ala 

95 100 1°3 



90 



52 



100 



148 



196 



388 



GAC GTT GAA GTT GTT TCA AGT ATT GCA TTG TCA GAA GCT CAA ATT GAA 
Aso Val Glu Val Val Ser Ser lie Ala Leu Ser Glu Ala Gin He Glu 
P HO H5 120 

AAA TTT AAA GCA ATG GCT AAA TCA AAA TTT GAT TTA AAC GAA GTA ACA 4 36 

Lvs Phe Lys Ala Met Ala Lys Ser Lys Phe Asp Leu Asn Glu Val Thr 
125 130 135 
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GTA ATT AAT ACA GTC AAT GAA AAA ATT CTC GGA GGA TTC ATT GTG AAC 484 
Val He Asn Thr Val Asn Glu Lys He Leu Gly Gly Phe He Val Asn 
140 145 ISO 

TCT CGT GGA AAA ATT ATT GAC GCC TCA TTA AAA ACA CAA TTG GCT AAA 532 
Ser Arg Gly Lys He He Asp Ala Ser Leu Lys Thr Gin Leu Ala Lys 
155 160 165 

ATC GCC GCT GAA ATC CTC TAATCAGGAT AGAAAAATTT TCTTCCTTTG 580 
He Ala Ala Glu He Leu 
170 175 

TTAAAAACTT AGT GGAGAAT TTTTCAAACT CAAACTGTTA AACTTTTGAA AAC AT GC AAA 64 0 

GGTAATTTTA AAACTTGCTT ATT CAT GCT C AAAAAGTATA ACTGCAGTTT AAAGCTAAAT 7 00 

AGCCTTGAAC TAGTAAAAAA TTTCTAGAAG G GAG CAT ATT T TTG GGA ATT AAA 7 53 

Leu Ala He Lys 
1 

GCT AAT GAA ATC AGC TCA CTG ATT AAA AAA CAA ATT GAA AAT TTC ACA 801 
Ala Asn Glu He Ser Ser Leu He Lys Lys Gin He Glu Asn Phe Thr 
5 10 15 20 

CCA GAT TTT GAA GTT GCT GAA ACT GGT GTC GTT ACC TAT GTT GGT GAT 84 9 

Pro Asp Phe Glu Val Ala Glu Thr Gly Val Val Thr Tyr Val Gly Asp 
2S 30 35 

GGT ATC GCG CGT GCC TAT GGC CTT GAA AAT GCG ATG AGC GGT GAG CTT 8 97 

Gly He Ala Arg Ala Tyr Gly Leu Glu Asn Ala Met Ser Gly Giu Leu 
40 45 50 

GTT GAG TTT TCA AAT GGT ATA CTT GGT ATG GCG CAA AAC TTG GAT GCT 945 
Val Glu Phe Ser Asn Gly He Leu Gly Met Ala Gin Asn Leu Asp Ala 
55 60 65 

ACA GAC GTT GGT ATT ATC GTA CTT GGT GAT TTC CTC TCA ATT CGT GAA 993 
Thr Asp Val Gly He He Val Leu Gly Asp Phe Leu Ser He Arg Glu 
70 75 80 

GGT GAC ACT GTT AAA CGT ACA GGT AAA ATC ATG GAA ATC CAA GTT GGT 1041 
Glv Asp Thr Val Lys Arg Thr Gly Lys He Met Glu He Gin Val Gly 
85 90 95 100 

GAA GAA CTC ATC GGA CGT GTT GTA AAC CCA CTT GGA CAA CCC GTT GAT 1089 
Glu Glu Leu He Gly Arg Val Val Asn Pro Leu Gly Gin Pro Val Asp 
105 HO H5 

GGA CTT GGA GAA CTT AAT ACA GGT AAA ACT CGT CCA GTT GAA GGA AAA 1137 
Gly Leu Gly Glu Leu Asn Thr Gly Lys Thr Arg Pro Val Glu Ala Lys 
120 125 130 

GCT CCT GGT GTT ATG CAA CGT AAA TCA GTC TCT GAG CCA TTA CAA ACT 1185 
Ala Pro Gly Val Met Gin Arg Lys Ser Val Ser Glu Pro Leu Gin Thr 
135 140 145 

GGT CTT AAA GCG ATT GAT GCC CTC GTT CCA ATT GGA CGT GGA CAA CGT 1233 
Gly Leu Lys Ala He Asp Ala Leu Val Pro He Gly Arg Gly Gin Arg 
150 155 160 
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GAA TTA ATT ATC GGA GAC CGT CAA ACT GGT AAA ACT TCA GTC GCT ATT 12 81 

Glu Leu lie lie Gly Asp Arg Gin Thr Gly Lys Thr Ser Val Ala lie 



165 



170 175 180 



GAT GCA ATC TTG AAC CAA AAA GGT CAA GAT ATG ATT TGT ATC TAT CTT 1329 
Asp Ala He Leu Asn Gin Lys Gly Gin Asp Met He Cys He Tyr Val 
185 190 19b 

GCG ATT GGA CAA AAA GAG TCA ACA GTT CGT ACA CAA GTT GAA ACG CTC 1377 
Ala lie Gly Gin Lys Glu Ser Thr Val Arg Thr Gin Val Glu Thr Leu 
200 205 210 

CGT AAA CTC GGT GCG ATG GAT TAT ACA ATC GTC GTA ACT GCG TCA GCT 1425 
Arg Lys Leu Gly Ala Met Asp Tyr Thr He Val Val Thr Ala Ser Ala 
215 220 225 

TfT CAA CCT TCT CCA CTC CTT TAC ATC GCT CCT TAC GCT GGA GCT GCA 1473 
Ser Gin Pro Ser Pro Leu Leu Tyr He Ala Pro Tyr Ala Gly Ala Ala 
230 235 240 

ATG GGT GAA GAA TTT ATG TAT AAC GGT AAA CAT GTC TTG GTT GTT TAT 1521 
Met Gly Glu Glu Phe Met Tyr Asn Gly Lys His Val Leu Val Val Tyr 
245 250 255 260 

GAT GAT TTA TCT AAA CAA GCG GTC GCT TAC CGT GAA CTT TCT CTC TTG 1569 
Asp Asp Leu Ser Lys Gin Ala Val Ala Tyr Arg Glu Leu Ser Leu Leu 

CTC CGT CGT CCA CCA GGT CGT GAA GCA TAC CCA GGT GAC GTT TTC TAC 1617 
Leu Arg Arg Pro Pro Gly Arg Glu Ala Tyr Pro Gly Asp Val Phe Tyr 
280 285 290 

TTG CAC TCA CGT CTT TTG GAA CGT GCT GCT AAA TTG TCT GAT GAT CTT 1665 
Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu Ser Asp Asp Leu 
295 300 305 

GGT GGT GGA TCA ATG ACG GCT TTG CCA TTC ATT GAA ACA CAA GCA GGT 1713 
Glv Glv Gly Ser Met Thr Ala Leu Pro Phe He Glu Thr Gin Ala Gly 
310 315 320 

GAT ATC TCA GCT TAT ATT CCA ACA AAC GTT ATC TCT ATT ACC GAC GGT 1761 
Asp He Ser Ala Tyr lie Pro Thr Asn Val He Ser He Thr Asp Gly 

CAA ATT TTC CTT GAA AAT GAC TTG TTC TAT TCA GGT GTA CGT CCT GCC 1809 
Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly Val Arg Pro Ala 
345 350 355 

ATT GAT GCT GGT TCA TCA GTA TCA CGT GTT GGT GGT GCC GCA CAA ATC 1857 
He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ala Ala Gin He 
360 365 370 

AAA GCC ATG AAG AAA GTA GCT GGT ACT TTG CGT CTT GAC CTT GCG TCG 1905 
Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp Leu Ala Ser 
375 380 385 

TTC CGT GAA CTT GAA GCC TTT ACA CAA TTT GGT TCT GAC CTT GAT GAA 1953 
Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp Leu Asp Glu 
390 395 400 

GCG ACT CAA GCA AAA TTG AAT CGT GGT CGT CGT ACC GTT GAA GTC TTG 2001 
Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val Glu Val Leu 
405 410 415 420 



WO 98/10089 



38 



PCT/DK97/00373 



AAA CAA CCA TTG CAC AAA CCA TTG GCT GH GM AAA CM GTT TTG ATT 
I£ SI Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin VI Leu lie 



425 



CTC TAT GCA TTG ACT CAT GGT CAT CTT GAT AAT GTT CCA GTT GAT GAT 
SS t£ Ala Leu Thr His Gly His Leu Asp Asn Val Pro Val Asp Asp 

440 445 

GTT CTT GAT TTT GAA ACT AAA ATG TTC GAT TTC TTC GAT GCA AAT TAT 
Val Su Sp Phe Glu Thr Lys Met Phe Asp Phe Phe Asp Ala Asn Tyr 
455 460 4 

GCA GAT CTC TTG AAC GTA ATT ACT GAC ACT AAA GAT TTG CCA GAA GAA 
Sa Asp Leu Leu Asn Val He Thr Asp Thr Lys Asp Leu Pro Glu Glu 



470 



475 



GCA AAA CTT GAC GAA GCA ATT AAA GCA TTC AAA AAT ACA ACG AAT TAT 
Ala Lys Leu Asp Glu Ala lie Lys Ala Phe Lys Asn Thr Thr Asn Tyr 

490 433 



485 



TAATAAGGAG GCTAACTA ATG GGA GCT TCA CTT AAC GAA ATA AAA ACT AAG 
TAATAA Me t Gly Ala Ser Leu Asn Glu He Lys Thr Lys 



1 



5 10 



2049 



2097 



2145 



2193 



2241 



2292 



2340 



2388 



2436 



ATT GCG TCA ACA AAG AAA ACA ACT CAA ATC ACA GGT GCC ATG CAA ATG 
lie Ala Ser Thr Lys Lys Thr Ser Gin He Thr Gly Ala Met Gin Met 
15 20 2b 

CTT TCT GCT GCT AAA CTT CAA AAA GCA GAA TCT CAC GCT AAA GCT TTT 
Val Ser £5 SI Lys Leu Gin Lys Ala Glu Ser His Ala Lys Ala Phe 
30 35 q 

CAG ACT TAT GCT GAA AAA GTA CGT AAG ATT ACG ACT GAC TTA GTT TCA 
Sn ?nl Tyr Ala Glu Lys Val Arg Lys He Thr Thr Asp Leu Val Ser 
45 SO " 

AGC GAT AAT GAG CCG GCC AAA AAT CCG ATG ATG ATT AAA CGT GAA CTC 2484 
5er S Asn Glu Pro Ala Lys Asn Pro Met Met He Lys Arg Glu Val 
60 65 70 

AAG AAA ACT GGC TAT CTC GTT ATC ACA TCA GAT CGT GGG CTT GTT GGC 
£s L^ ?S Gly Tyr Leu Val He Thr Ser Asp Arg Gly Leu Val Gly 
80 85 

ACT TAT AAT TCA AAT ATT TTG AAG TCT GTT ATA ACT AAT ATA CGT AAA 
stl Tyr Asn Ser Asn He Leu Lys Ser Val He Ser Asn lie Arg Lys 

95 100 

CGC CAC ACA AAT GAG ACT GAG TAT ACA ATA CTT GCC CTT GGT GGT ACG 2628 
Arg His Thr Asn Glu Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr 
110 115 12 

rGA GCG GAC TTT TTC AAA GCC CGT AAC GTC AAA GTT TCT TAT GTT CTT 2676 
SJ £a Sp IFe Phe Lys Ala Arg Asn Val Lys Val Ser Tyr Val Leu 
125 130 135 

CGC GGA CTT TCA GAT CAA CCG ACC TTT GAA GAG GTT CGG GCA ATT GTT 
Arg Gly Leu Ser Asp Gin Pro Thr Phe Glu Glu Val Arg Ala lie Val 
140 1" 150 155 



2532 



2580 



2724 



ACA GAA GCC GTA GAA GAA TAT CAA GCA GAA GAA TTC GAT GAA CTC TAT 2772 
Thr Glu Ala Val Glu Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr 
160 165 1/0 
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GTT TGT TAC AAC CAC CAT GTG AAC TCA TTG GTA AGT GAG GCA CGG ATG 2820 
Val Cys Tyr Asn His His Val Asn Ser Leu Val Sex Glu Ala Arg Met 
Y 175 180 185 

GAA AAA ATG TTA CCT ATT TCT TTT GAT GAA AAA GGT GAC GAA AAA GCA 2868 
Glu Lys Met Leu Pro lie Ser Phe Asp Glu Lys Gly Asp Glu Lys Ala 
190 195 200 

TCT CTT GTT ACA TTT GAA TTA GAA CCA GAT CGT GAA ACA ATC TTA AAT 2916 
Ser Leu Val Thr Phe Glu Leu Glu Pro Asp Arg Glu Thr lie Leu Asn 
205 210 215 

CAG TTG TTG CCG CAA TAT GCT GAA AGT ATG ATT TAT GGC TCA ATT GTT 2964 
Gin Leu Leu Pro Gin Tyr Ala Glu Ser Met lie Tyr Gly Ser He Val 
220 225 230 235 

GAT GCA AAA ACA GCA GAA CAT GCT GCA GGT ATG ACC GCA ATG CGT ACT 3012 
Asp Ala Lys Thr Ala Glu His Ala Ala Gly Met Thr Ala Met Arg Thr 
240 245 250 

GCA ACA GAT AAT GCA CAT TCT GTC ATT AAT GAT TTA ACC ATT CAA TAT 3060 
Ala Thr Asp Asn Ala His Ser Val He Asn Asp Leu Thr lie Gin Tyr 
255 260 265 

AAC CGT GCT CGT CAA GCT TCA ATT ACG CAA GAA ATT ACG GAA ATT GTT 3108 
Asn Arg Ala Arg Gin Ala Ser He Thr Gin Glu He Thr Glu He Val 
270 275 280 

GCG GGT GCT TCA GCG CTA TAATTACTGT CAAACATTAT TCTCAATGTT 3156 
Ala Gly Ala Ser Ala Leu 
285 

ACGATTTATC AACTTGAGGA ATAAATGTTC TGTCAGTAAA GGCTTTGAAT TTTAAATACG 3216 

TTTGTCAGTA AATTTTTACT GATTAGCTTA AAAATGAATA GAAATTCTGT TGTTAGACAG 3276 

AAAATAAAAA CAGGAGGAAA AACA TTG AGT TCT GGT AAA ATT ACT CAG GTT 3327 

Leu Ser Ser Gly Lys He Thr Gin Val 
1 5 

ATC GGT CCC GTC GTT GAC GTG GAA TTT GGT TCT GAT GCC AAA CTG CCT 3375 
He Gly Pro Val Val Asp Val Glu Phe Gly Ser Asp Ala Lys Leu Pro 
10 15 20 25 

GAG ATT AAC AAT GCC TTG ATT GTC TAC AAA GAT GTC AAT GGT TTA AAA 3423 
Glu He Asn Asn Ala Leu He Val Tyr Lys Asp Val Asn Gly Leu Lys 
30 35 40 

ACA AAA ATT ACT CTT GAA GTT GCT TTG GAA CTT GGT GAT GGT GCA GTT 3471 
Thr Lvs He Thr Leu Glu Val Ala Leu Glu Leu Gly Asp Gly Ala Val 
45 50 55 

CGT ACG ATC GCT ATG GAA TCT ACT GAT GGA TTG ACT CGT GGA CTT GAA 3519 
Ara Thr He Ala Met Glu Ser Thr Asp Gly Leu Thr Arg Gly Leu Glu 
60 65 70 

GTC CTT GAT ACA GGT AAA GCG GTC AGC GTT CCT GTT GGT GAA TCT ACT 3567 
Val Leu Asp Thr Gly Lys Ala Val Ser Val Pro Val Gly Glu Ser Thr 
75 80 85 

CTT GGT CGT GTT TTT AAT GTC CTT GGT GAC GTT ATT GAT GGT GGA GAA 3615 
Leu Glv Arg Val Phe Asn Val Leu Gly Asp Val He Asp Gly Gly Glu 
90 95 100 105 
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GAT TTC CCT GCT GAT GCA GAA CGT AAT CCT ATC CAC AAG AAA GCT CCA 
Zl He Pro Sa Asp Ala Glu Arg Asn Pro lie His Lys Lys Ala Pro 



110 



ACT TTT GAC GAA TTG TCA ACT GCA AAT GAA GTT CTT GTA ACA GGG ATT 
JS S £u Leu ser Thr Ala Asn Glu Val Leu Val Thr Gly He 

125 130 ■ LJ 

AAA GTT GTC GAT TTA CTT GCC CCT TAT CTT AAA GGT GGG AAA GTC GGA 
™ 55 S5 Sp leu Leu Ala Pro Tyr Leu Lys Gly Gly Lys Val Gly 
140 145 13U 

CTC TTC GGT GGT GCC GGT GTT GGT AAA ACC GTC CTT ATC CAA GAA TTG 
2S £e XJ Gly Ala Gly Val Gly Lys Thr Val Leu lie Gin Glu Leu 
155 J- 60 165 

ATT CAC AAT ATT GCC CAA GAA CAC GGT GGT ATT TCT GTA TTT ACA GGT 
S ^ ?Ie Sa Gin Glu His Gly Gly lie Ser Val Phe Thr Gly 

170 I 75 180 

GTT GGC GAT CGT ACT CGT GAC GGG AAT GAC CTT TAC TGG GAA ATG AAA 
511 Sp lf g Thr Arg Asp Gly Asn Asp Leu Tyr Tr P Glu Met Lys 



190 



2B5 



3663 



3711 



3759 



3807 



3855 



3903 



3951 



rAA TCA GGC GTT ATT GAA AAA ACA GCC ATG GTC TTT GGT CAA ATG AAT 
gJS ITr 25 SS tie Glu Lys Thr Ala Met Val Phe Gly Gin Met Asn 
205 210 ^ 

GAA CCA CCT GGA GCA CGT ATG CGT GTT GCC CTT ACT GGT TTA ACA ATT 
21 Si Pro Gly Ala Arg Met Arg Val Ala Leu Thr Gly Leu Thr lie 
220 225 230 

GCG GAA TAT TTC CGT GAT GTT CAA GGA CAA GAC GTA TTG CTT TTC ATC 
Sa SJ Tyr Phe Arg Asp Val Gin Gly Gin Asp Val Leu Leu Phe He 
235 240 z " 

GAT AAC ATC TTC CGT TTC ACT CAA GCT GGT TCA GAA GTT TCT GCC CTT 
£p £n ne Phe Arg Phe Thr Gin Ala Gly Ser Glu Val Ser Ala Leu 
250 255 260 2" 

Trr rGA CGT ATG CCT TCT GCC GTT GGT TAC CAA CCA ACT CTT GCA ACT 414 3 

?rp Ty Axg ill SS LI Ala Val Gly Tyr Gin Pro Thr Leu Ala Thr 
270 275 " u 

GAA ATG GTT CAA TTA CAG GAA CGT ATC ACT TCT ACT AAG AAG GGT TCT 
£1 Het 5al Gin Leu Gin Glu Arg He Thr Ser Thr Lys Lys Gly Ser 

290 29i 



3999 



4047 



4095 



4191 



4239 



GTT ACA TCT ATC CCA GCG ATT TAT GTC CCT GCC GAT GAC TAT ACT GAC 
5al ?S Ser lie Pro Ala He Tyr Val Pro Ala Asp Asp Tyr Thr Asp 
300 305 310 

CCA GCG CCA GCT ACA GCC TTC GCT CAC TTG GAC GCA ACA ACT AAC TTG 
P™ Ala Pro Ala Thr Ala Phe Ala His Leu Asp Ala Thr Thr Asn Leu 
315 320 325 

GAA CGT CGT TTG ACA CAA ATG GGT ATC TAT CCA GCC GTT GAC CCA CTT 
Glu £1 Arg Leu Thr Gin Met Gly lie Tyr Pro Ala Val Asp Pro Leu 
33O 335 340 345 

GCT TCA TCA TCA CGT GCG CTT ACA CCT GAA ATT GTT GGT GAA GAA CAC 4 383 

SI ser Ser Ser Arg Ala Leu Thr Pro Glu He Val Gly Glu Glu His 
350 355 360 



4287 



4335 
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TAT GAA GTT GCA ATG GAA GTT CAA CGT GTC CTT CAA CGC TAG AAA GAA 4431 
™r Si Val Ala Met Glu Val Gin Arg Val Leu Gin Arg Tyr Lys Glu 



365 



4479 



4527 



4671 

4717 

4777 
4815 



TTG CAA GAT ATC ATT GCC ATT CTT GGT ATG GAT GAA TTG TCA GAT GAT 
Su Gin Asp lie lie Ala He Leu Gly Met Asp Glu Leu Ser Asp Asp 
380 385 39° 

GAA AAA ATT CTC GTT GGA CGT GCA CGT CGT ATC CAA TTC TTC CTT TCA 
Si Jys lie Leu Val Gly Arg Ala Arg Arg He Gin Phe Phe Leu Ser 
395 400 405 

CAA AAC TTC CAC GTT GCT GAA CAG TTT ACT GGT CAA CCT GGT TCA TAT 4575 
£S Jin Phe His Val Ala Glu Gin Phe Thr Gly Gin Pro Gly Ser Tyr 
410 415 420 

GTA CCA ATT GAC AAA ACA GTT CAT GAC TTC AAG GAA ATT TTG GAA GGT 4 623 

5al Pro lie Asp Lys Thr Val His Asp Phe Lys Glu He Leu Glu Gly 
430 435 4 

AAA TAT GAC GAA GTC CCT GAA GAT GCT TTC CGT GGA GTA GGT CCA ATT 
JJi Tyr Asp Glu Val Pro Glu Asp Ala Phe Arg Gly Val Gly Pro lie 
y y 445 450 455 

GAA GAC GTA CTT GCA AAA GCA AAA TCA ATG GGT TAT TAATTCGATT 
Glu Asp Val Leu Ala Lys Ala Lys Ser Met Gly Tyr 
460 465 

TCTTATGAAA TGACAAAGTG AAAATACATT ATTGAATCGC AAAATTTACT GACAATAATT 
CTGTCGTAAG TGCTCACTTT TAAGTTGTTC CGATCGTT 

(2) INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Lys Val Asn Ser Gin Lys Tyr Ser Lys Ala Leu Leu Glu Val 
1 5 10 15 

Ala Arg Glu Lys Gly Gin Leu Glu Ala He Leu Thr Glu Val Ser Glu 
20 25 30 

Met He Gin Leu Phe Lys Glu Asn Asn Leu Gly Ala Phe Leu Ala Asn 
35 40 45 

Glu Val Tyr Ser Phe Ser Ala Lys Ser Glu Leu He Asp Thr Leu Leu 
50 55 60 

Gin Thr Ser Ser Glu Val Met Ser Asn Phe Leu Asn Thr He Arg Ser 
65 70 75 80 

Asn Gly Arg Leu Ala Asp Leu Gly Glu He Leu Glu Glu Thr Lys Asn 
85 ^O 95 

Ala Ala Asp Asp Met Phe Lys lie Ala Asp Val Glu Val Val Ser Ser 
100 105 HO 
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He Ala Leu Ser Glu Ala Gin He 
115 120 

Ser Lys Phe Asp Leu Asn Glu Val 
130 135 

Lys He Leu Gly Gly Phe He Val 
145 150 

Ala Ser Leu Lys Thr Gin Leu Ala 
165 



Glu Lys Phe Lys Ala Met Ala Lys 

125 

Thr Val He Asn Thr Val Asn Glu 
140 

Asn Ser Arg Gly Lys He He Asp 
155 160 

Lys He Ala Ala Glu He Leu 
170 175 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 500 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Ala He Lys Ala Asn Glu He Ser Ser Leu He Lys Lys Gin He 
1 5 10 I 5 

Glu Asn Phe Thr Pro Asp Phe Glu Val Ala Glu Thr Gly Val Val Thr 
20 25 30 

Tvr Val Gly Asp Gly He Ala Arg Ala Tyr Gly Leu Glu Asn Ala Met 
35 40 45 

Ser Gly Glu Leu Val Glu Phe Ser Asn Gly He Leu Gly Met Ala Gin 
50 55 60 

Asn Leu Asp Ala Thr Asp Val Gly He He Val Leu Gly Asp Phe Leu 
65 70 75 80 

Ser He Arg Glu Gly Asp Thr Val Lys Arg Thr Gly Lys He Met Glu 
85 90 95 

He Gin Val Gly Glu Glu Leu He Gly Arg Val Val Asn Pro Leu Gly 
100 105 HO 

Gin Pro Val Asp Gly Leu Gly Glu Leu Asn Thr Gly Lys Thr Arg Pro 
115 120 125 

Val Glu Ala Lys Ala Pro Gly Val Met Gin Arg Lys Ser Val Ser Glu 
130 135 140 

Pro Leu Gin Thr Gly Leu Lys Ala He Asp Ala Leu Val Pro He Gly 
145 150 155 160 

Ara Glv Gin Arg Glu Leu He He Gly Asp Arg Gin Thr Gly Lys Thr 
165 170 175 

Ser Val Ala He Asp Ala He Leu Asn Gin Lys Gly Gin Asp Met He 
1B0 185 190 

Cys He Tyr Val Ala He Gly Gin Lys Glu Ser Thr Val Arg Thr Gin 
195 200 205 
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Val Glu Thr Leu Arg Lys Leu Gly Ala Met Asp Tyr Thr He Val Val 
210 215 220 

Thr Ala Ser Ala Ser Gin Pro Ser Pro Leu Leu Tyr He Ala Pro Tyr 
225 230 235 240 

Ala Gly Ala Ala Met Gly Glu Glu Phe Met Tyr Asn Gly Lys His Val 
245 250 255 

Leu Val Val Tyr Asp Asp Leu Ser Lys Gin Ala Val Ala Tyr Arg Glu 
260 265 270 

Leu Ser Leu Leu Leu Arg Arg Pro Pro Gly Arg Glu Ala Tyr Pro Gly 
275 280 285 

Asp Val Phe Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu 
290 295 300 

Ser Asp Asp Leu Gly Gly Gly Ser Met Thr Ala Leu Pro Phe He Glu 
305 310 315 320 

Thr Gin Ala Gly Asp He Ser Ala Tyr He Pro Thr Asn Val He Ser 
325 330 335 

He Thr Asp Gly Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly 
340 345 350 

Val Arg Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly 
355 360 365 

Ala Ala Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu 
370 375 380 

Asp Leu Ala Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser 
385 390 395 400 

Asp Leu Asp Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr 
405 410 415 

Val Glu Val Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys 
420 425 430 

Gin Val Leu He Leu Tyr Ala Leu Thr His Gly His Leu Asp Asn Val 
435 440 445 

Pro Val Asp Asp Val Leu Asp Phe Glu Thr Lys Met Phe Asp Phe Phe 
450 455 460 

Asp Ala Asn Tyr Ala Asp Leu Leu Asn Val He Thr Asp Thr Lys Asp 
465 470 475 460 

Leu Pro Glu Glu Ala Lys Leu Asp Glu Ala He Lys Ala Phe Lys Asn 
485 490 495 

Thr Thr Asn Tyr 
500 



(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE : amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Ser Leu Asn Glu lie Lys Thr Lys lie Ala Ser Thr Lys 
1 5 10 15 

Lys Thr Ser Gin lie Thr Gly Ala Met Gin Met Val Ser Ala Ala Lys 
20 25 30 

Leu Gin Lys Ala Glu Ser His Ala Lys Ala Phe Gin Thr Tyr Ala Glu 
35 40 45 

Lys Val Arg Lys lie Thr Thr Asp Leu Val Ser Ser Asp Asn Glu Pro 
50 55 60 

Ala Lys Asn Pro Met Met lie Lys Arg Glu Val Lys Lys Thr Gly Tyr 
65 70 75 80 

Leu Val lie Thr Ser Asp Arg Gly Leu Val Gly Ser Tyr Asn Ser Asn 
85 90 95 

lie Leu Lys Ser Val lie Ser Asn He Arg Lys Arg His Thr Asn Glu 
100 105 HO 

Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr Gly Ala Asp Phe Phe 
115 120 125 

Lys Ala Arg Asn Val Lys Val Ser Tyr Val Leu Arg Gly Leu Ser Asp 
130 135 140 

Gin Pro Thr Phe Glu Glu Val Arg Ala He Val Thr Glu Ala Val Glu 
145 150 155 160 

Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr Val Cys Tyr Asn His 
165 170 175 

His Val Asn Ser Leu Val Ser Glu Ala Arg Met Glu Lys Met Leu Pro 
180 185 190 

He Ser Phe Asp Glu Lys Gly Asp Glu Lys Ala Ser Leu Val Thr Phe 
195 200 205 

Glu Leu Glu Pro Asp Arg Glu Thr He Leu Asn Gin Leu Leu Pro Gin 
210 215 220 

Tyr Ala Glu Ser Met He Tyr Gly Ser He Val Asp Ala Lys Thr Ala 
225 230 235 240 

Glu His Ala Ala Gly Met Thr Ala Met Arg Thr Ala Thr Asp Asn Ala 
245 250 255 

His Ser Val He Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Arg Gin 
260 265 270 

Ala Ser He Thr Gin Glu He Thr Glu He Val Ala Gly Ala Ser Ala 
275 280 285 

Leu 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Ser Ser Gly Lys lie Thr Gin Val lie Gly Pro Val Val Asp Val 
15 10 15 

Glu Phe Gly Ser Asp Ala Lys Leu Pro Glu lie Asn Asn Ala Leu lie 
20 25 30 

Val Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys lie Thr Leu Glu Val 
35 40 45 

Ala Leu Glu Leu Gly Asp Gly Ala Val Arg Thr lie Ala Met Glu Ser 
50 55 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala 
65 70 75 80 

Val Ser Val Pro Val Gly Glu Ser Thr Leu Gly Arg Val Phe Asn Val 
B5 90 95 

Leu Gly Asp Val lie Asp Gly Gly Glu Asp Phe Pro Ala Asp Ala Glu 
100 105 110 

Arg Asn Pro lie His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
115 120 125 

Ala Asn Glu Val Leu Val Thr Gly He Lys Val Val Asp Leu Leu Ala 
130 135 140 

Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
145 150 155 160 

Gly Lys Thr Val Leu He Gin Glu Leu He His Asn He Ala Gin Glu 
165 170 175 

His Gly Gly He Ser Val Phe Thr Gly Val Gly Asp Arg Thr Arg Asp 
180 185 190 

Gly Asn Asp Leu Tyr Trp Glu Met Lys Glu Ser Gly Val He Glu Lys 
195 200 205 

Thr Ala Met Val Phe Gly Gin Met Asn Glu Pro Pro Gly Ala Arg Met 
210 215 220 

Arg Val Ala Leu Thr Gly Leu Thr He Ala Glu Tyr Phe Arg Asp Val 
225 230 235 240 

Gin Gly Gin Asp Val Leu Leu Phe He Asp Asn He Phe Arg Phe Thr 
245 250 255 

Gin Ala Gly Ser Glu Val Ser Ala Leu Trp Gly Arg Met Pro Ser Ala 
260 265 270 

Val Gly Tyr Gin Pro Thr Leu Ala Thr Glu Met Val Gin Leu Gin Glu 
275 280 285 
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Arg lie Thr Ser Thr Lys Lys Gly Ser Val Thr Ser He Pro Ala He 
290 295 300 

Tyr Val Pro Ala Asp Asp Tyr Thr Asp Pro Ala Pro Ala Thr Ala Phe 
305 310 315 320 

Ala His Leu Asp Ala Thr Thr Asn Leu Glu Arg Arg Leu Thr Gin Met 
325 330 335 

Gly He Tyr Pro Ala Val Asp Pro Leu Ala Ser Ser Ser Arg Ala Leu 
340 345 350 

Thr Pro Glu He Val Gly Glu Glu His Tyr Glu Val Ala Met Glu Val 
355 360 365 

Gin Arg Val Leu Gin Arg Tyr Lys Glu Leu Gin Asp He He Ala He 
370 375 380 

Leu Gly Met Asp Glu Leu Ser Asp Asp Glu Lys He Leu Val Gly Arg 
385 390 395 400 

Ala Arg Arg He Gin Phe Phe Leu Ser Gin Asn Phe His Val Ala Glu 
405 410 415 

Gin Phe Thr Gly Gin Pro Gly Ser Tyr Val Pro He Asp Lys Thr Val 
420 425 430 

His Asp Phe Lys Glu He Leu Glu Gly Lys Tyr Asp Glu Val Pro Glu 
435 440 445 

Asp Ala Phe Arg Gly Val Gly Pro He Glu Asp Val Leu Ala Lys Ala 
450 455 460 

Lys Ser Met Gly Tyr 
465 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2207 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATIONS . .633 

(D) OTHER INFORMATION: /partial 
/codon_start= 4 

/product= "ATPase subunit, partial sequence" 
/gene= "atpA" 

/standard_name= "alpha subunit of the Fl portion 
of the F0F1 ATPase" 
/label= alpha-subunit 
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(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 652. . 1518 

(D) OTHER INFORMATION: /codon_start= 652 
/product* 3 "ATPase subunit" 
/gene= "atpG" 

/ standard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/label= gamma- subunit 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1654 . .2205 

(D) OTHER INFORMATION : /partial 
/ codon_s tart- 1654 

/product 13 "ATPase subunit, partial sequence" 
/gene= "atpD" 

/standard_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGA TTC TAC TTA CAT TCA CGT CTT TTG GAA CGT GCT GCC AAA TTA TCT 4 8 

Phe Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu Ser 
470 475 480 

GAC TAT CTT GGT GGT GGT TCA ATG ACT GCA CTG CCA TTC ATT GAA ACA 96 
Asp Tyr Leu Gly Gly Gly Ser Met Thr Ala Leu Pro Phe lie Glu Thr 
485 490 495 500 

CAA GCC GGA GAT ATC TCA GCT TAT ATT GCA ACA AAC GTT ATC TCT ATT 144 
Gin Ala Gly Asp lie Ser Ala Tyr lie Ala Thr Asn Val He Ser He 
505 510 515 

ACT GAC GGT CAA ATT TTC CTT GAA AAT GAC TTA TTC TAT TCA GGT GTA 192 
Thr Asp Gly Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly Val 
520 525 530 

CGT CCT GCC ATC GAT GCT GGT TCT TCA GTT TCT CGG GTT GGT GGT GCT 24 0 

Arg Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ala 
535 540 545 

GCA CAG ATC AAA GCC ATG AAG AAA GTT GCT GGT ACT TTG CGT CTT GAC 28 8 

Ala Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp 
550 555 560 

CTT GCG TCA TTC CGT GAA CTT GAA GCC TTT ACT CAA TTT GGT TCT GAT 336 
Leu Ala Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp 
565 570 575 580 

CTT GAT GAA GCG ACT CAA GCA AAA TTG AAT CGT GGT CGT CGT ACC GTT 384 
Leu Asp Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val 
585 590 595 

GAA GTT TTG AAG CAA CCA TTG CAC AAA CCA TTG GCT GTT GAA AAA CAA 4 32 

Glu Val Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin 
600 605 610 

GTT TTA ATT CTT TAT GCA TTG ACT CAT GGT CAC TTG GAT GAT GTT CCA 480 
Val Leu He Leu Tyr Ala Leu Thr His Gly His Leu Asp Asp Val Pro 
615 620 625 



WO 98/10089 



48 



PCT/DK97/00373 



GTT GAT GAC GTC CTT GAT TTT GAA ACA AAC AAT GTC CGA TTC TTC GAT 528 
Val Asp Asp Val Leu Asp Phe Glu Thr Asn Asn Val Arg Phe Phe Asp 
630 635 640 

GCA AAT TAT GCA AAA CTC TTG AAC GTG ATT ACT GAA ACT AAA GAT TGC 576 
Ala Asn Tyr Ala Lys Leu Leu Asn Val lie Thr Glu Thr Lys Asp Cys 
645 650 655 660 

CAG AAG AAG GAA AAC TCG ACG AAG GAA TTA AAG CAT TCT AAA ATA CAA 624 
Gin Lys Lys Gin Asn Ser Thr Lys Gin Leu Lys His Ser Lys lie Gin 
665 670 675 

CGA ATT ATT AATAAGGAGG CTAATCTA ATG GGA GCT TCA CTT AAT GAA ATA 675 
Arg He He Met Gly Ala Ser Leu Asn Glu He 

1 5 

AAA ACT AAG ATT GCC TCA ACG AAG AAA ACA AGT CAA ATA ACT GGA GCC 723 
Lys Thr Lys He Ala Ser Thr Lys Lys Thr Ser Gin He Thr Gly Ala 
10 15 20 

ATG CAA ATG GTT TCC GCT GCG AAA CTT CAA AAA GCT GAA TCT CAT GCC 771 
Met Gin Met Val Ser Ala Ala Lys Leu Gin Lys Ala Glu Ser 1 His Ala 
25 30 35 40 

AAA GCA TTT CAA ATT TAT GCT GAA AAA GTT CGT AAA ATT ACA ACT GAT 819 
Lys Ala Phe Gin He Tyr Ala Glu Lys Val Arg Lys He Thr Thr Asp 
45 50 55 

TTA GTT TCC TCT GAC AAA GAG CCA GCT AAG AAT CCA ATG ATG ATA GGA 867 
Leu Val Ser Ser Asp Lys Glu Pro Ala Lys Asn Pro Met Met He Gly 
60 65 70 

AGA GAA GTC AAA AAA ACT GGC TAT CTT GTA ATT ACT TCG GAT CGT GGA 915 
Arg Glu Val Lys Lys Thr Gly Tyr Leu Val He Thr Ser Asp Arg Gly 
75 80 85 

CTT GTC GGT GGC TAT AAT TCA TAT ATT TTG AAA TCT GTC ATG AAT ACT 963 
Leu Val Gly Gly Tyr Asn Ser Tyr He Leu Lys Ser Val Met Asn Thr 
90 95 100 

ATC CGT AAA CGT CCT GCT AAT GAA AGT GAA TAT ACT ATT CTT GCA CTT 1011 
He Arg Lys Arg Pro Ala Asn Glu Ser Glu Tyr Thr He Leu Ala Leu 
105 HO 115 120 

GGC GGT ACT GGA GCA GAT TTC TTC GGA GCA AGC AAT GTT AAA AGT TTC 1059 
Gly Gly Thr Gly Ala Asp Phe Phe Gly Ala Ser Asn Val Lys Ser Phe 
125 130 135 

TTA GTC CTT TGT GGT TTT TCA GAC CAA CCA AAT TTT GAA GAA GTT AGA 1107 
Leu Val Leu Cys Gly Phe Ser Asp Gin Pro Asn Phe Glu Glu Val Arg 
140 145 150 

GCG ATT GTT ACA GAA GCG GTA ACT GAA TAT CAA GCA GAA GAA TTT GAT 1155 
Ala He Val Thr Glu Ala Val Thr Glu Tyr Gin Ala Glu Glu Phe Asp 
155 160 165 

GAA CTT TAT GTT TGC TAT AAT CAC CAT GTG AAC TCA TTG GTA AGT GAA 1203 
Glu Leu Tyr Val Cys Tyr Asn His His Val Asn Ser Leu Val Ser Glu 
170 175 180 

GCA AGT ATG GAA AAA ATG TTG CCT ATT TTT TTT GAA GCA TCA GGT CAA 1251 
Ala Ser Met Glu Lys Met Leu Pro He Phe Phe Glu Ala Ser Gly Gin 
185 190 195 200 
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CAA AAA CCA TTT TTT GAA ACA TTT GAA TTA GAA CCA GAT TGT GAA ACA 1299 
Gin Lys Pro Phe Phe Glu Thr Phe Glu Leu Glu Pro Asp Cys Glu Thr 
205 210 215 

ATT TTA AAC CAA TTG TTG CCA CCA TAC GCT GAA AGT ATG ATT TAT GGT 1347 
He Leu Asn Gin Leu Leu Pro Pro Tyr Ala Glu Ser Met He Tyr Gly 
220 225 230 

TCA ATC GTT GAT GCT AAG ACA GCA GAA CAT GCT GCA GGT ATG ACA GCA 1395 
Ser He Val Asp Ala Lys Thr Ala Glu His Ala Ala Gly Met Thr Ala 
235 240 245 

ATG CGT ACT GCA ACT GAT AAT GCT CAC TCT GTT ATC AAT GAT TTG ACT 14 4 3 

Met Arg Thr Ala Thr Asp Asn Ala His Ser Val He Asn Asp Leu Thr 
250 255 260 

ATT CAA TAC AAC CGT GCT CGT CAA GCA TCG ATT ACG CAA GAA ATT ACG 14 91 

He Gin Tyr Asn Arg Ala Arg Gin Ala Ser He Thr Gin Glu He Thr 
265 270 275 2B0 

GAA ATC GTT GCA GGA GCC TCA GCG CTT TAATTTACTG ATAGGAATTC 1538 
Glu He Val Ala Gly Ala Ser Ala Leu 
285 

TGTCAGTGAT GGCTTTGAAT CTTAATTGTT TTTGTCAGTA AAATTTTTAC TGACAAACAT 1598 

AAAAATGAAT AGAAATTCTG TTCTTTGACA GAAAATAAAA ACAGGAGGAA AAACA TTG 1656 

Leu 
1 

AGT TCT GGT AAA ATT ACT CAG ATT ATC GGT CCC GTC GTT GAC GTG GAA 1704 
Ser Ser Gly Lys He Thr Gin He He Gly Pro Val Val Asp Val Glu 
5 10 15 

TTT GGT TCT GAT GCC AAA TTG CCT GAG ATT AAC AAT GCC TTG ATT GTC 17 52 

Phe Gly Ser Asp Ala Lys Leu Pro Glu He Asn Asn Ala Leu lie Val 
20 25 30 

TAC AAA GAT GTC AAT GGC CTA AAA ACA AAA ATT ACT CTT GAA GTT GCT 18 00 

Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys He Thr Leu Glu Val Ala 
35 40 45 

TTG GAA CTT GGT GAT GGT GCA GTT CGT ACA ATC GCT ATG GAA TCT ACT 184 8 

Leu Glu Leu Gly Asp Gly Ala Val Arg Thr He Ala Met Glu Ser Thr 
50 55 60 65 

GAT GGC TTG ACT CGT GGA CTT GAA GTC CTT GAT ACA GGT AAA GCA GTC 1896 
Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala Val 
70 75 80 

AGC GTT CCT GTT GGG GAA GCC ACT CTT GGT CGT GTT TTT AAC GTC CTT 194 4 

Ser Val Pro Val Gly Glu Ala Thr Leu Gly Arg Val Phe Asn Val Leu 
85 90 95 

GGT GAT GTT ATT GAC GGT GGG GAA GAA TTT GCT GCT GAT GCA GAA CGT 1992 
Gly Asp Val He Asp Gly Gly Glu Glu Phe Ala Ala Asp Ala Glu Arg 
100 105 110 

AAT CCT ATC CAT AAA AAA GCT CCA ACA TTT GAC GAA TTG TCA ACT GCA 2040 
Asn Pro He His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr Ala 
115 120 125 
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AAC GAA GTT CTC GTA ACT GGG ATT AAA GTT GTC GAT TTG CTT GCA CCT 2088 
Asn Glu Val Leu Val Thr Gly lie Lys Val Val Asp Leu Leu Ala Pro 
130 135 140 145 

TAC CTT AAA GGT GGT AAA GTT GGA CTT TTC GGT GGT GCC GGA GTT GGT 2136 
Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly 
150 155 160 

AAA GCC GTC CTT ATT CAA GAA TTG AAA CAC AAC ATC GCC CAA GAA CAC 2184 
Lys Ala Val Leu lie Gin Glu Leu Lys His Asn lie Ala Gin Glu His 
165 170 175 

GGA GGT ATT TCT GTG TTT ACC GG 2207 
Gly Gly lie Ser Val Phe Thr 
160 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Phe Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu Ser Asp 
15 10 15 

Tyr Leu Gly Gly Gly Ser Met Thr Ala Leu Pro Phe lie Glu Thr Gin 
20 25 30 

Ala Gly Asp lie Ser Ala Tyr lie Ala Thr Asn Val He Ser He Thr 
35 40 45 

Asp Gly Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly Val Arg 
50 55 60 

Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ala Ala 
65 70 75 80 

Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp Leu 
85 90 95 

Ala Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp Leu 
100 105 110 

Asp Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val Glu 
115 120 125 

Val Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin Val 
130 135 140 

Leu He Leu Tyr Ala Leu Thr His Gly His Leu Asp Asp Val Pro Val 
145 150 155 160 

Asp Asp Val Leu Asp Phe Glu Thr Asn Asn Val Arg Phe Phe Asp Ala 
165 170 175 

Asn Tyr Ala Lys Leu Leu Asn Val He Thr Glu Thr Lys Asp Cys Gin 
180 185 190 
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Lys Lys Gin Asn Ser Thr Lys Gin Leu Lys His Ser Lys lie Gin Arg 
195 200 205 

He He 
210 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Ala Ser Leu Asn Glu He Lys Thr Lys He Ala Ser Thr Lys 
15 10 15 

Lys Thr Ser Gin He Thr Gly Ala Met Gin Met Val Ser Ala Ala Lys 
20 25 30 

Leu Gin Lys Ala Glu Ser His Ala Lys Ala Phe Gin He Tyr Ala Glu 
35 40 45 

Lys Val Arg Lys He Thr Thr Asp Leu Val Ser Ser Asp Lys Glu Pro 
50 55 60 

Ala Lys Asn Pro Met Met He Gly Arg Glu Val Lys Lys Thr Gly Tyr 
65 70 75 80 

Leu Val He Thr Ser Asp Arg Gly Leu Val Gly Gly Tyr Asn Ser Tyr 
85 90 95 

He Leu Lys Ser Val Met Asn Thr He Arg Lys Arg Pro Ala Asn Glu 
100 105 110 

Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr Gly Ala Asp Phe Phe 
115 120 125 

Gly Ala Ser Asn Val Lys Ser Phe Leu Val Leu Cys Gly Phe Ser Asp 
130 135 140 

Gin Pro Asn Phe Glu Glu Val Arg Ala He Val Thr Glu Ala Val Thr 
145 150 155 160 

Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr Val Cys Tyr Asn His 
165 170 175 

His Val Asn Ser Leu Val Ser Glu Ala Ser Met Glu Lys Met Leu Pro 
180 185 190 

He Phe Phe Glu Ala Ser Gly Gin Gin Lys Pro Phe Phe Glu Thr Phe 
195 200 205 

Glu Leu Glu Pro Asp Cys Glu Thr He Leu Asn Gin Leu Leu Pro Pro 
210 215 220 

Tyr Ala Glu Ser Met He Tyr Gly Ser He Val Asp Ala Lys Thr Ala 
225 230 235 240 
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Glu His Ala Ala Gly Met Thr Ala Met Arg Thr Ala Thr Asp Asn Ala 
245 250 255 

His Ser Val lie Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Arg Gin 
260 265 270 

Ala Ser He Thr Gin Glu He Thr Glu He Val Ala Gly Ala Ser Ala 
275 260 285 

Leu 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Ser Ser Gly Lys He Thr Gin He He Gly Pro Val Val Asp Val 
1 5 10 15 

Glu Phe Gly Ser Asp Ala Lys Leu Pro Glu He Asn Asn Ala Leu He 
20 25 30 

Val Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys He Thr Leu Glu Val 
35 40 45 

Ala Leu Glu Leu Gly Asp Gly Ala Val Arg Thr He Ala Met Glu Ser 
50 55 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala 
65 70 75 80 

Val Ser Val Pro Val Gly Glu Ala Thr Leu Gly Arg Val Phe Asn Val 
85 90 95 

Leu Gly Asp Val He Asp Gly Gly Glu Glu Phe Ala Ala Asp Ala Glu 
100 105 110 

Arg Asn Pro He His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
115 120 125 

Ala Asn Glu Val Leu Val Thr Gly He Lys Val Val Asp Leu Leu Ala 
130 135 140 

Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
145 150 15S 160 

Gly Lys Ala Val Leu He Gin Glu Leu Lys His Asn He Ala Gin Glu 
165 170 175 

His Gly Gly He Ser Val Phe Thr 
180 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2161 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus thermophilus 

(B) STRAIN: ST3 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION : 2 . .637 

(D) OTHER INFORMATION: /partial 
/codon_start= 2 

/product^ "ATPase subunit, partial sequence" 
/gene= "atpA" 

/standard_name= "alpha subunit of the Fl portion 
of the F0F1 ATPase" 
/label= alpha-subunit 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 659. .1537 

(D) OTHER INFORMATION: /codon_start= 659 
/product^ "ATPase subunit" 
/gene= "atpG" 

/standard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/label= gamma- subunit 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1616. .2161 

(D) OTHER INFORMATION: /partial 
/codon_start= 1616 

/product= "ATPase subunit, partial sequence" 
/gene= "atpD" 

/standard_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

T GAT TCT CAT CTC CAC TCA CGT CTT TTG GAA CGT TCA GCT AAG CTT 4 6 

Asp Ser His Leu His Ser Arg Leu Leu Glu Arg Ser Ala Lys Leu 
185 190 195 

TCA GAT GAT CTT GGT GGT GGT TCA ATG ACT GCC TTG CCA ATC ATC CAA 94 
Ser Asp Asp Leu Gly Gly Gly Ser Met Thr Ala Leu Pro He He Gin 
200 205 210 215 

ACA CAA GCA GGA GAT ATC TCA GCT TAT ATC GCG ACA AAC GTT ATT TCT 142 
Thr Gin Ala Gly Asp He Ser Ala Tyr He Ala Thr Asn Val He Ser 
220 225 230 
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ATC ACA GAT GGA CAA ATC TTC TTG CAA GAA AAT CTT TTC AAC TCA GGT 190 
lie Thr Asp Gly Gin lie Phe Leu Gin Glu Asn Leu Phe Asn Ser Gly 
235 240 245 

ATT CGT CCT GCG ATT GAT GCT GGT TCT TCA GTA TCA CGT GTT GGT GGT 238 
lie Arg Pro Ala lie Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly 
250 255 260 

TCA GCA CAA ATC AAA GCA ATG AAG AAA GTT GCT GGT ACC CTT CGT CTT 286 
Ser Ala Gin lie Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu 
265 270 275 

GAC TTG GCT TCT CAC CGT GAA CTT GAA GCC TTT ACA CAA TTC GGT TCT 334 
Asp Leu Ala Ser His Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser 
280 285 290 295 

GAT TTG GAT GCC GCA ACA CAA GCT AAA CTT AAT CGT GGA CGT CGT ACA 382 
Asp Leu Asp Ala Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr 
300 305 310 

GTT GAA GTG CTT AAA CAA CCA CTT CAT AAC CCA CTT CCG GTT GAA AAA 4 30 

Val Glu Val Leu Lys Gin Pro Leu His Asn Pro Leu Pro Val Glu Lys 
315 320 325 

CAA GTT CTT ATT CTT TAC GCT TTG ACA CAT GGC TTC TTG GAC AGT GTT 478 
Gin Val Leu lie Leu Tyr Ala Leu Thr His Gly Phe Leu Asp Ser Val 
330 335 340 

CCG GTT GAT CAA ATC TTG GAT TTT GAA GAA GCC CTC TAT GAC TAC TTT 526 
Pro Val Asp Gin lie Leu Asp Phe Glu Glu Ala Leu Tyr Asp Tyr Phe 
345 350 355 

GAT AGC CAT CAT GAG GAT ATC TTT GAA ACA ATC CGT TCA ACT AAG GAT 574 
Asp Ser His His Glu Asp lie Phe Glu Thr He Arg Ser Thr Lys Asp 
360 365 370 375 

CTT CCT GAA GAA GCT GTG CTT AAT GAA GCT ATC CAA GCT TTC AAA GAT 622 
Leu Pro Glu Glu Ala Val Leu , Asn Glu Ala He Gin Ala Phe Lys Asp 
380 385 390 

CAA TCG GAA TAC AAA TAGAGATAGG GAG GAC AGC A T ATG GCA GGC TCT CTA 67 3 

Gin Ser Glu Tyr Lys Met Ala Gly Ser Leu 

395 1 5 

AGA GAA ATC AAA GCA AAA ATT GCT TCA ATT AAG CAA ACG AGT CAT ATT 721 
Arg Glu He Lys Ala Lys He Ala Ser He Lys Gin Thr Ser His He 
10 15 20 

ACA GGA GCC ATG CAA ATG GTT TCT GCT TCT AAA TTG ACA CGT TCT GAG 769 
Thr Gly Ala Met Gin Met Val Ser Ala Ser Lys Leu Thr Arg Ser Glu 
25 30 35 

CAG GCT GCT AAA GAT TTC CAA ATC TAT GCC TCA AAA ATT AGA CAG ATC 817 
Gin Ala Ala Lys Asp Phe Gin He Tyr Ala Ser Lys He Arg Gin He 
40 45 50 

ACA ACA GAT CTT CTA CAT TCA GAA TTG GTT AAT GGT TCT TCA AAT CCG 865 
Thr Thr Asp Leu Leu His Ser Glu Leu Val Asn Gly Ser Ser Asn Pro 
55 60 65 

ATG TTG GAT GCA CGT CCA GTT CGT AAG TCA GGG TAT ATT GTC ATT ACT 913 
Met Leu Asp Ala Arg Pro Val Arg Lys Ser Gly Tyr He Val He Thr 
70 75 80 85 



WO 98/10089 



55 



PCT/DK97/00373 



TCA GAT AAG GGA TTA GTT GGA GGA TAT AAT TCA ACC ATT CTT AAA GCT 961 

Ser Asp Lys Gly Leu Val Gly Gly Tyr Asn Ser Thr He Leu Lys Ala 
90 95 100 

GTC TTG GAT ATG ATT AAA CGT GAC CAT GAT TCT GAA GAT GAA TAT GCT 1009 

Val Leu Asp Met He Lys Arg Asp His Asp Ser Glu Asp Glu Tyr Ala 
105 110 115 

ATC ATC TCT ATT GGT GGA ACA GGT TCA GAT TTC TTC AAA GCT CGT AAC 1057 

He He Ser He Gly Gly Thr Gly Ser Asp Phe Phe Lys Ala Arg Asn 
120 125 130 

ATG AAT GTT GCT TTT GAA CTT CGT GGC CTT GAA GAT CAA CCT AGT TTC 1105 

Met Asn Val Ala Phe Glu Leu Arg Gly Leu Glu Asp Gin Pro Ser Phe 
135 140 145 

GAT CAA GTC GGG GAA ATC ATT CTA AAA GCT GTA GGA ATG TAT CAA AAT 1153 

Asp Gin Val Gly Glu He He Leu Lys Ala Val Gly Met Tyr Gin Asn 

150 155 160 165 

GAG CTT TTT GAT GAA CTT TAT GTG TGT TAC AAT CAT CAT ATT AAT AGT 1201 

Glu Leu Phe Asp Glu Leu Tyr Val Cys Tyr Asn His His He Asn Ser 
170 175 180 

TTG TTT TGT GAA GTT TGT GTT GAA AAA ATG CTT CCA ATT GCT GAT TTT 1249 

Leu Phe Cys Glu Val Cys Val Glu Lys Met Le.u Pro He Ala Asp Phe 
185 190 195 

GAT CCT AAT GAA TTT GAA GGC CAT GTA TTG ACC AAG TTT GAA TTG GAA 1297 

Asp Pro Asn Glu Phe Glu Gly His Val Leu Thr Lys Phe Glu Leu Glu 
200 205 210 

CCA AGT TGT GAT ACT ATT TTG GAT CAA CTT TTG CCC ACA ATA GTC GGT 1345 

Pro Ser Cys Asp Thr He Leu Asp Gin Leu Leu Pro Thr He Val Gly 
215 220 225 

GAG AGT TTT ATC TAC GGT GCT ATC GTA GAT GCC AAA ACA GCT GAG CAT 1393 

Glu Ser Phe He Tyr Gly Ala He Val Asp Ala Lys Thr Ala Glu His 

230 235 240 245 

GCT GCT GGT ATG ACC GCA ATG CAG ACT GCC ACT GAT AAT GCT AAG AAA 14 41 

Ala Ala Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys 
250 255 260 

ATA ATT AAC GAT TTA ACA ATT CAA TAC AAC CGT GCA CGT CAA GCA GCC 14 89 

He He Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Arg Gin Ala Ala 
265 270 275 

ATT ACT CAG GAA ATC ACT GAG ATT GTT GGC GGT GCT AGT GCA CTT GAA 1537 

He Thr Gin Glu He Thr Glu He Val Gly Gly Ala Ser Ala Leu Glu 
280 285 290 

TAGCTAGAGA TTTGTCTTGA TTTGACATAC AATAAAAAGG GATGATTGTC ATCCAGAAAA 1597 

CTT CAT AAG G AGAAAACA ATG AGC TCA GGC AAA ATT GCT CAG GTT GTT GGT 164 8 

Met Ser Ser Gly Lys He Ala Gin Val Val Gly 
15 10 

CCT GTT GTA GAC GTA GCG TTT GCA ACT GGC GAT AAA CTT CCT GAG ATT 1696 

Pro Val Val Asp Val Ala Phe Ala Thr Gly Asp Lys Leu Pro Glu He 
15 20 25 
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AAC AAT GCA TTG GTC GTT TAC ACT GAG AAG AAA AGT CTT AGA CGG ATG 174 4 

Asn Asn Ala Leu Val Val Tyr Thr Glu Lys Lys Ser Leu Arg Arg Met 
30 35 40 

GTG CTC GAA GTA GCT TCG TTG AAA CTT GGA GAA GGT GTG GTT CGT ACC 17 92 

Val Leu Glu Val Ala Ser Leu Lys Leu Gly Glu Gly Val Val Arg Thr 
45 50 55 

ATT GCC ATG GAA TCT ACT GAT GGA TTG ACT CGT GGG CTA GAA GTT CTG 1640 
lie Ala Met Glu Ser Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu 
60 65 70 75 

GAC ACA GGT CGT CCA ATC AGT GTT CCT GTT GGT AAA GAA CTT CTT GGA 18 88 

Asp Thr Gly Arg Pro lie Ser Val Pro Val Gly Lys Glu Leu Leu Gly 
80 85 90 

CGT GTC TTT AAC GTG CTT GGA GAT ACC ATT GAC ATG GAA GCA CCT TTT 1936 
Arg Val Phe Asn Val Leu Gly Asp Thr He Asp Met Glu Ala Pro Phe 
95 100 105 

GCA GAT GAT GCA GAG CGT GAA CCA ATT CAT AAA AAA GCA CCT ACC TTC 1984 
Ala Asp Asp Ala Glu Arg Glu Pro He His Lys Lys Ala Pro Thr Phe 
110 115 120 

GAT GAA TTG TCA ACA AGT ACT GAA ATC CTT GAA ACA GGG ATT AAA GTT 2032 
Asp Glu Leu Ser Thr Ser Thr Glu He Leu Glu Thr Gly He Lys Val 
125 130 135 

ATC GAC TTG CTT GCC CCT TAT CTT AAA GGT GGT AAA GTC GGA CTT TTC 2080 
He Asp Leu Leu Ala Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe 
140 145 150 155 

GGT GGT GCC GGT GTT GGT AAG GCC GTT CTT ATT CAA GAG CTG AAT CAC 2128 
Gly Gly Ala Gly Val Gly Lys Ala Val Leu He Gin Glu Leu Asn His 
160 165 170 

AAC ATT GCT CAA GAA CAC GGT GGC ATT TCC GTG 2161 
Asn He Ala Gin Glu His Gly Gly lie Ser Val 
175 180 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Ser His Leu His Ser Arg Leu Leu Glu Arg Ser Ala Lys Leu Ser 
15 10 15 

Asp Asp Leu Gly Gly Gly Ser Met Thr Ala Leu Pro He He Gin Thr 
20 25 30 

Gin Ala Gly Asp He Ser Ala Tyr He Ala Thr Asn Val He Ser He 
35 40 45 

Thr Asp Gly Gin He Phe Leu Gin Glu Asn Leu Phe Asn Ser Gly He 
50 55 60 
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Arg Pro Ala lie Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ser 
65 70 75 80 

Ala Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp 
85 90 95 

Leu Ala Ser His Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp 
100 105 110 

Leu Asp Ala Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val 
115 120 125 

Glu Val Leu Lys Gin Pro Leu His Asn Pro Leu Pro Val Glu Lys Gin 
130 135 140 

Val Leu He Leu Tyr Ala Leu Thr His Gly Phe Leu Asp Ser Val Pro 
145 150 155 160 

Val Asp Gin He Leu Asp Phe Glu Glu Ala Leu Tyr Asp Tyr Phe Asp 
165 170 175 

Ser His His Glu Asp He Phe Glu Thr He Arg Ser Thr Lys Asp Leu 
180 185 190 

Pro Glu Glu Ala Val Leu Asn Glu Ala He Gin Ala Phe Lys Asp Gin 
195 200 205 

Ser Glu Tyr Lys 
210 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Gly Ser Leu Arg Glu He Lys Ala Lys He Ala Ser He Lys 
15 10 15 

Gin Thr Ser His He Thr Gly Ala Met Gin Met Val Ser Ala Ser Lys 
20 25 30 

Leu Thr Arg Ser Glu Gin Ala Ala Lys Asp Phe Gin He Tyr Ala Ser 
35 40 45 

Lys He Arg Gin He Thr Thr Asp Leu Leu His Ser Glu Leu Val Asn 
50 55 60 

Gly Ser Ser Asn Pro Met Leu Asp Ala Arg Pro Val Arg Lys Ser Gly 
65 70 75 80 

Tyr He Val He Thr Ser Asp Lys Gly Leu Val Gly Gly Tyr Asn Ser 
85 90 95 

Thr He Leu Lys Ala Val Leu Asp Met He Lys Arg Asp His Asp Ser 
100 105 110 
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Glu Asp Glu Tyr Ala He He Ser He Gly Gly Thr Gly Sec Asp Phe 
115 120 125 

Phe Lys Ala Arg Asn Met Asn Val Ala Phe Glu Leu Arg Gly Leu Glu 
130 135 140 

Asp Gin Pro Ser Phe Asp Gin Val Gly Glu He He Leu Lys Ala Val 
145 150 155 160 

Gly Met Tyr Gin Asn Glu Leu Phe Asp Glu Leu Tyr Val Cys Tyr Asn 
165 170 175 

His His He Asn Ser Leu Phe Cys Glu Val Cys Val Glu Lys Met Leu 
180 185 190 

Pro He Ala Asp Phe Asp Pro Asn Glu Phe Glu Gly His Val Leu Thr 
195 200 205 

Lys Phe Glu Leu Glu Pro Ser Cys Asp Thr He Leu Asp Gin Leu Leu 
210 215 220 

Pro Thr He Val Gly Glu Ser Phe He Tyr Gly Ala He Val Asp Ala 
225 230 235 240 

Lys Thr Ala Glu His Ala Ala Gly Met Thr Ala Met Gin Thr Ala Thr 
245 250 255 

Asp Asn Ala Lys Lys He He Asn Asp Leu Thr He Gin Tyr Asn Arg 
260 265 270 

Ala Arg Gin Ala Ala He Thr Gin Glu He Thr Glu He Val Gly Gly 
275 280 285 

Ala Ser Ala Leu Glu 
290 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ser Ser Gly Lys He Ala Gin Val Val Gly Pro Val Val Asp Val 
15 10 15 

Ala Phe Ala Thr Gly Asp Lys Leu Pro Glu He Asn Asn Ala Leu Val 
20 25 30 

Val Tyr Thr Glu Lys Lys Scr Leu Arg Arg Met Val Leu Glu Val Ala 
35 40 45 

Ser Leu Lys Leu Gly Glu Gly Val Val Arg Thr He Ala Met Glu Ser 
50 55 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Arg Pro 
65 70 75 80 
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lie 



Ser 



Val Pro Val Gly Lys Glu Leu Leu Gly Arg Val Phe Asn Val 
85 90 95 



Leu 



Gly 



Asp Thr lie Asp Met Glu Ala Pro Phe Ala Asp Asp Ala Glu 
100 105 110 



Arg 



Glu 



Pro lie His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
115 120 125 



Ser 



Thr 
130 



Glu lie Leu Glu Thr Gly lie Lys Val He Asp Leu Leu Ala 
135 140 



Pro 
145 



Tyr 



Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
150 155 160 



Gly Lys Ala Val Leu lie Gin Glu Leu Asn His Asn He Ala Gin Glu 
165 170 175 

His Gly Gly He Ser Val 
180 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: C- terminal 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phaffia rhodozyma 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 51 . . 824 

(D) OTHER INFORMATION: /partial 
/codon_start= 51 

/product= "ATPase subunit, partial sequence" 
/gene= "ATP2 " 

/standard_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAATTCTCAA CCTTGAGGGT GACTCCAAGG TCGCTCTTGT CTTCGGACAG ATG AAC 56 



Met Asn 



GAG CCC CCG GGT GCT CGA GCC CGA GTC GCT TTG ACT GGT TTG ACC ATC 
Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu Thr He 
185 190 195 200 



104 
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GCC GAG TAC TTC CGA GAC GAG GAA GGA CAG GAT GTC TTG CTT TTC ATC 152 

Ala Glu Tyr Phe Arg Asp Glu Glu Gly Gin Asp Val Leu Leu Phe lie 

205 210 215 

GAC AAC ATT TTC CGA TTC ACC CAG GCC GGT TCT GAG GTG TCT GCC TTG 200 

Asp Asn He Phe Arg Phe Thr Gin Ala Gly Ser Glu Val Ser Ala Leu 
220 225 230 

CTT GGT CGA ATT CCC TCC GCC GTC GGA TAC CAG CCC ACT CTT TCC ACC 24 8 

Leu Gly Arg He Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu Ser Thr 
235 240 245 

GAT ATG GGA GGT ATG CAG GAG CGA ATT ACC ACC ACC AAG AAG GGA TCC 2 96 

Asp Met Gly Gly Met Gin Glu Arg He Thr Thr Thr Lys Lys Gly Ser 
250 255 260 

ATC ACT TCC GTC CAG GCC GTC TAC GTG CCT GCT GAT GAT TTG ACC GAT 34 4 

He Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu Thr Asp 
265 270 275 280 

CCT GCC CCC GCC ACC ACC TTC GCC CAC TTG GAC GCC ACC ACT GTG TTG 392 

Pro Ala Pro Ala Thr Thr Phe Ala His Leu Asp Ala Thr Thr Val Leu 

285 290 295 

TCT CGA GGT ATC GCT GAG TTG GGT ATC TAC CCC GCT GTC GAT CCC CTT 4 40 

Ser Arg Gly He Ala Glu Leu Gly He Tyr Pro Ala Val Asp Pro Leu 
300 305 310 

GAT TCT AAG TCC CGA ATG CTC GAC CCC CGA ATT GTC GGA CAG GAG CAC 4 88 

Asp Ser Lys Ser Arg Met Leu Asp Pro Arg He Val Gly Gin Glu His 
315 320 325 

TAC GAC ATC GCC ACC AAG ACC CAG AAG ATC CTC CAG GAC TAC AAG TCC 536 

Tyr Asp He Ala Thr Lys Thr Gin Lys He Leu Gin Asp Tyr Lys Ser 
330 335 340 

CTC CAG GAT ATC ATT GCC ATT CTT GGT ATG GAT GAG TTG TCT GAG GAG 584 

Leu Gin Asp He He Ala He Leu Gly Met Asp Glu Leu Ser Glu Glu 
345 350 355 360 

GAC AAG TTG ACC GTC GAG CGA GCC CGA AAG ATC CAG CGA TTC ATG TCG 632 

Asp Lys Leu Thr Val Glu Arg Ala Arg Lys He Gin Arg Phe Met Ser 

365 370 375 

CAG CCT TTC GCT GTC GCT CAG GTC TTC ACT GGT ATC GAG GGA AAG CTT 680 

Gin Pro Phe Ala Val Ala Gin Val Phe Thr Gly He Glu Gly Lys Leu 
380 385 390 

GTT CCC TTG AAG ACT ACT TTG GAG TCC TTT AAG GAG CTT CTT TCC GGA 728 

Val Pro Leu Lys Thr Thr Leu Glu Ser Phe Lys Glu Leu Leu Ser Gly 
395 400 405 

GCC TGC GAC CAC CTC CCT GAG TCT GCT TTC TAC ATG GTT GGT GAC ATC 776 

Ala Cys Asp His Leu Pro Glu Ser Ala Phe Tyr Met Val Gly Asp He 
410 415 420 

GCT GAT GTC AAG GCC AAG GCT GCT GCC CAG GCT AAG GAG TTG GCT GCT 824 

Ala Asp Val Lys Ala Lys Ala Ala Ala Gin Ala Lys Glu Leu Ala Ala 
425 430 435 440 

TAAGAGAAGA GTTGTCGAAT GTGTTTCGAG GTGTCAGAGT TGTCTTTTAT GAATGTTTCT 884 

ATCTCCTTAA AAAAAAAAAA AAAAAAAAAA 914 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Asn Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu 
15 10 15 

Thr lie Ala Glu Tyr Phe Arg Asp Glu Glu Gly Gin Asp Val Leu Leu 
20 25 30 

Phe He Asp Asn He Phe Arg Phe Thr Gin Ala Gly Ser Glu Val Ser 
35 40 45 

Ala Leu Leu Gly Arg He Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu 
50 55 60 

Ser Thr Asp Met Gly Gly Met Gin Glu Arg He Thr Thr Thr Lys Lys 
65 70 75 80 

Gly Ser He Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu 
85 90 95 

Thr Asp Pro Ala Pro Ala Thr Thr Phe Ala His Leu Asp Ala Thr Thr 
100 105 110 

Val Leu Ser Arg Gly He Ala Glu Leu Gly He Tyr Pro Ala Val Asp 
115 120 125 

Pro Leu Asp Ser Lys Ser Arg Met Leu Asp Pro Arg He Val Gly Gin 
130 135 140 

Glu His Tyr Asp He Ala Thr Lys Thr Gin Lys He Leu Gin Asp Tyr 
145 150 155 160 

Lys Ser Leu Gin Asp He He Ala He Leu Gly Met Asp Glu Leu Ser 
165 170 175 

Glu Glu Asp Lys Leu Thr Val Glu Arg Ala Arg Lys He Gin Arg Phe 
180 185 190 

Met Ser Gin Pro Phe Ala Val Ala Gin Val Phe Thr Gly He Glu Gly 
195 200 205 

Lys Leu Val Pro Leu Lys Thr Thr Leu Glu Ser Phe Lys Glu Leu Leu 
210 215 220 

Ser Gly Ala Cys Asp His Leu Pro Glu Ser Ala Phe Tyr Met Val Gly 
225 230 235 240 

Asp He Ala Asp Val Lys Ala Lys Ala Ala Ala Gin Ala Lys Glu Leu 
245 250 255 

1 a ^Vl a 
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(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichoderma reesei 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 50. .361 

(D) OTHER INFORMATION: /partial 
/codon_start= 50 

/product- "ATPase subunit, partial sequence" 
/gene= "ATP 2 " 

/standard_name= "beta subunit of Fl portion of the 
F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TACTCGAAGA ATTCGGCACG AGGCTGATTG CTCTCGGTCA TCTGCCAAG ATG TTC 55 

Met Phe 
260 

AAG AGC GGC GTT TCG TCC CTC GCC AGG GCT GCC CGC CCA TCA ATT ACC 103 
Lys Ser Gly Val Ser Ser Leu Ala Arg Ala Ala Arg Pro Ser lie Thr 
265 270 275 

GCT CGA CGA GCT ATC CGA CCA GCC TTC CCT CGA ACC CCC CTC GCG AGG 151 
Ala Arg Arg Ala lie Arg Pro Ala Phe Pro Arg Thr Pro Leu Ala Arg 
280 285 290 

CTT GCC AGC ACC CAG AGC GTC GGA GAT GGC AAG ATC CAC CAG GTC ATT 199 
Leu Ala Ser Thr Gin Ser Val Gly Asp Gly Lys He His Gin Val He 
295 300 305 

GGT GCC GTC GTC GAC GTC AAG TTC GAC ACC GCC AAG CTG CCT CCT ATC 247 
Gly Ala Val Val Asp Val Lys Phe Asp Thr Ala Lys Leu Pro Pro He 
310 315 320 

CTG AAC GCC CTG GAG ACC ACC AAC AAC AAC CAG AAG CTG GTC CTC GAG 295 
Leu Asn Ala Leu Glu Thr Thr Asn Asn Asn Gin Lys Leu Val Leu Glu 
325 330 335 340 

GTG GCT CAA CAC TTG GGC GAG AAT GTC GTT CGC TGC ATT GCC ATG GAC 343 
Val Ala Gin His Leu Gly Glu Asn Val Val Arg Cys He Ala Met Asp 
345 350 355 

GGA TCC GAG GGT CTC GTC GTGGTTCCAA GGCA 375 
Gly Ser Glu Gly Leu Val 
360 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Phe Lys Ser Gly Val Ser Ser Leu Ala Arg Ala Ala Arg Pro Ser 
15 10 15 

lie Thr Ala Arg Arg Ala He Arg Pro Ala Phe Pro Arg Thr Pro Leu 
20 25 30 

Ala Arg Leu Ala Ser Thr Gin Ser Val Gly Asp Gly Lys He His Gin 
35 40 45 

Val He Gly Ala Val Val Asp Val Lys Phe Asp Thr Ala Lys Leu Pro 
50 55 60 

Pro He Leu Asn Ala Leu Glu Thr Thr Asn Asn Asn Gin Lys Leu Val 
65 70 75 80 

Leu Glu Val Ala Gin His Leu Gly Glu Asn Val Val Arg Cys He Ala 
85 90 95 

Met Asp Gly Ser Glu Gly Leu Val 
100 



WO 98/10089 



64 



PCT/DK97/00373 



PATENT CLAIMS 

1. A method of improving the production of biomass or a 
desired product from a cell, characterized by expressing 

5 an uncoupled ATPase activity in said cell to induce con- 
version of ATP to ADP without primary effects on other 
cellular metabolites or functions, and incubating the 
cell with a suitable substrate to produce said biomass or 
product • 

2 . A method according to claim 1 , characterized by ex- 
pressing in said cell the soluble part (Fi) of the mem- 
brane bound (FqFi type) H + -ATPase or a portion of Fi ex- 
hibiting ATPase activity. 

3* A method according to claim 1 or 2 , wherein said cell 
is a prokaryotic cell . 

4. A method according to claim 3, wherein said cell is 
selected from the group consisting of bacteria belonging 
to the genera Lactococcus, Streptococcus, Enterococcus , 
Lactobacillus, Leuconostoc, EschBrichla, Zymomonas , Ba- 
cillus and Pseudomonas. 

5. A method according to claim 1 or 2 , wherein said cell 
is a eukaryotic cell. 

6. A method according to claim 5, wherein said cell is a 
yeast cell. 

7. A method according to claim 6, wherein said cell be- 
longs to Saccharomyces cerevlslae or Trichoderma reesei. 

8. A method according to any one of claims 1-7, wherein 
said cell is transformed or transfected with an expres- 
sion vector including DNA encoding F^ or a portion 
thereof exhibiting ATPase activity under the control of a 
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promoter functioning in said cell, and said DNA is ex- 
pressed in the cell . 

9. A method according to claim 8, wherein said DNA en- 
5 coding Fx or a portion thereof is homologous to said 

cell. 

10. A method according to claim 8, wherein said DNA en- 
coding Fx or a portion thereof is heterologous to said 

10 cell. 

11. A method according to any one of claims 8-10, 
wherein said DNA encoding Fx or a portion thereof is de- 
rived from a prokaryotic organism. 

15 

12. A method according to claim 11, wherein said DNA en- 
coding Fx or a portion thereof is derived from Esche- 
richia coli f Lactococcus lactls or Streptococcus thermo- 
phllus and is selected from the group consisting of the 

20 gene encoding the Fx subunit P or a portion thereof and 
various combinations of said gene or portion with the 
genes encoding the Fx subunits 5, a, y and e or portions 
thereof . 

25 13. A method according to claim 12, wherein said DNA en- 
coding Fx or a portion thereof is selected from the group 
consisting of the Escherichia coli. Streptococcus thermo- 
philus and Lactococcus lactls genes a tpHAGDC (coding for 
subunits 6, a, y, P, e) , atpAGDC (coding for subunits a, 

30 y, P, e), atpAGD (coding for subunits a, y, P) , atpDC 
(coding for subunits p, e) and atpD (coding for subunit p 
alone) . 

14. A method according to any one of claims 8-10, 
35 wherein said DNA encoding Fx or a portion thereof is de- 
rived from a eukaryotic organism. 



WO 98/10089 



66 



PCT/DK97/00373 



15. A method according to claim 14, wherein said DNA en- 
coding Fi or a portion thereof is derived from Saccharo- 
myces cerevislae, Phaffia rhodozyma or Trichoderma reesei 
and is selected from the group consisting of the gene en- 

5 coding the Fi subunit p or a portion thereof and various 
combinations of said gene or portion with the genes en- 
coding the other Fx subunits or portions thereof* 

16. A vector including DNA encoding the soluble part 
10 (Fi) of the membrane bound (FqFi type) H + -ATPase or a 

portion of Fi exhibiting ATPase activity, said DNA being 
derived from Lactococcus lactls subsp. cremorls and hav- 
ing the sequence stated in SEQ ID No, 1. 

15 17. a vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FoFi type) H + -ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 
derived from Lactococcus lactls subsp. lactls and having 
the sequence stated in SEQ ID No. 6. 

20 

18. A vector including DNA encoding the soluble part 
( Fi ) of the membrane bound ( FqFi type ) H + -ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 
derived from Str&ptococcus thermophilics and having the 

25 sequence stated in SEQ ID No. 10. 

19 . A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (F()Fi type) H + - ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 

30 derived from Phaffia rhodozyma and having the sequence 
stated in SEQ ID No. 14. 

20. A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FoFi type) H + - ATPase or a 

35 portion of Fi exhibiting ATPase activity, said DNA being 
derived from Trichoderma reesei and having the sequence 
stated in SEQ ID No. 16. 
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21 ♦ An expression vector including DNA as defined in any 
one of claims 16-20 under the control of a promoter capa- 
ble of directing the expression of said DNA in a prokary- 
otic or eukaryotic cell. 

22 • A method of optimizing the formation of biomass or a 
desired product by a cell, c har ac ter i zed by expressing 
different levels of uncoupled ATPase activity in the 
cell, incubating the cell on a suitable substrate, meas- 
uring the conversion rate of substrate into biomass or 
the desired product at each level of ATPase expression, 
and choosing a level of ATPase expression at which the 
conversion rate is optimized. 

23. A method according to claim 22, wherein a number of 
specimens of said cell are transformed or trans fee ted 
with their respective expression vector each including 
DNA encoding a different portion of the cytoplasmic part 
(Fi) of the membrane bound (F()Fi type) H + -ATPase up to 
and including the entire Fi, each portion exhibiting ATP- 
ase activity, said DNA in each expression vector being 
under the control of a promoter functioning in said cell, 
incubating each cell specimen on a suitable substrate, 
measuring the conversion rate of substrate into biomass 
or the desired product by each specimen, and choosing a 
specimen yielding an optimized conversion rate. 

24. A method according to claim 22, wherein a number of 
specimens of said cell are transformed or trans fected 
with their respective expression vector including DNA en- 
coding a portion of the cytoplasmic part (Fi) of the mem- 
brane bound (FqFi type) H + -ATPase up to and including the 
entire F^, said portion exhibiting ATPase activity, said 
DNA in the respective expression vectors being under the 
control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell, incubating each cell specimen on a suitable sub- 
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strate, measuring the conversion rate of substrate into 
biomass or the desired product by each specimen, and 
choosing a specimen yielding an optimized conversion 
rate. 

5 

25. A method according to claim 24, wherein the respec- 
tive expression vectors include DNA encoding different 
such portions of Fi up to and including the entire Fi, 
each DNA in respective expression vectors being under the 

10 control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell. 

26. A method according to any one of claims 23-25, 
15 wherein the promoter in each expression vector is an in- 
ducible promoter, and each cell specimen is grown at dif- 
ferent concentrations of inducer. 

27. A method according to any one of claims 23-26, 
20 wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is homologous to said cell. 

28. A method according to any one of claims 23-26, 
wherein said DNA encoding a portion of Fi up to and in- 

25 eluding the entire Fi is heterologous to said cell. 

29. A method according to any one of claims 23-28, 
wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is derived from a prokaryotic or- 

30 ganism . 

30. A method according to claim 29, wherein said DNA en- 
coding a portion of Fi up to and including the entire Fi 
is derived from Escherichia coll, Lactococcus lactis or 

35 Streptococcus thermophilus and is selected from the group 
consisting of the gene encoding the Fi subunit p or a 
portion thereof and various combinations of said gene or 
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portion with the genes encoding the Fi subunits 8, a, y 
and e or portions thereof - 

31. A method according to claim 30, wherein said DNA en- 
5 coding a portion of Fi up to and including the entire Fi 

is selected from the group consisting of the E. coli 
genes atpAGDC (coding for subunits a, y 9 P, e) , atpAGD 
(coding for subunits a, y, P), atpDC (coding for subunits 
p, e) and atpD (coding for subunit P alone). 

10 

32. A method according to any one of claims 23-28, 
wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is derived from a eukaryotic organ- 
ism* 

15 

33. A method according to claim 32, wherein said DNA en- 
coding Fi or a portion thereof is derived from Saccharo- 
myces cerevisiae, Phaffia rhodozyma or Trichoderma reesei 
and is selected from the group consisting of the gene en- 

20 coding the Fi subunit P or a portion thereof and various 
combinations of said gene or portion with the genes en- 
coding the other Fi subunits or portions thereof. 
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